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Abstract. Quasi-periodic motions on invariant tori of an integrable system of 
dimension smaller than half the phase space dimension may continue to exists after 
small perturbations. The parametric equations of the invariant tori can often be 
computed as formal power series in the perturbation parameter and can be given 
a meaning via resummations. Here we prove that, for a class of elliptic tori, a 
resummation algorithm can be devised and proved to be convergent, thus extending 
to such lower-dimensional invariant tori the methods employed to prove convergence 
of the Lindstedt series either for the maximal (i.e. KAM) tori or for the hyperbolic 
lower- dimensional invariant tori. 



1. Introduction 

Quasi-integrable analytic Hamiltonian systems are described by Hamiltonians of the form H = 
H Q {l)+eHx{<p, I), where (tp, I) e T x A, with A an open domain in R , are conjugate coordinates 
(called angle-action variables), the functions Tlo and Tli are analytic in their arguments, and e is 
a small real parameter. We shall consider for simplicity only Hamiltonians of the form 

W = il- 1 + £/(*>), (1.1) 

where • denotes the inner product in M. d . 

Kolmogorov's theorem (KAM theorem) yields, for e small enough, the existence of many invariant 
tori for Hamiltonian systems of the form (1.1): such tori can be parameterized by the corresponding 
rotation vectors, at least if the latter satisfy some Diophantine conditions. On the other hand 
Poincare's theorem states the existence of periodic orbits, which can be parameterized by rotation 
vectors satisfying d — 1 resonance conditions (so that after a simple linear canonical map one can 
assume that the rotation vector is (lui, 0, 0, . . . , 0)). 

A natural question is what happens of the invariant tori corresponding, in absence of pertur- 
bations, to rotation vectors satisfying s resonance conditions, with 1 < s < d — 2. If we fix 
the rotation vector as (w,0) = (u>±, . . . , <j r , 0, . . . , 0) and parameterize the invariant torus for 
e = with the action value 1 = then, after translating the origin in R d by (u>, 0) and set- 
ting I = (A, B) £ 8 r x R s , tp = (ct,/3) S T r x T , the Hamiltonian (1.1) becomes 

H = u- A+^A-A+iB-B + ef (a, (3), (1.2) 
where (a, A) G T r x R r and (/3, B) £ T s x R s are conjugate variables, with r + s = d, and • denotes 
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2: Degenerate elliptic resonances 

the inner product both in R r and in M s . Here we impose that a; is a vector in W satisfying 

\u>-v\>C Q \v\- Ta ViveZ r \{0}, (1.3) 

with Co > and to > r — 1, which is called the Diophantine condition] we shall define by D Tq (Cq) 
the set of rotation vectors in W satisfying (1.3). We also write 

f(cx,/3)=J2^ a fA^- (1-4) 

We shall suppose that / is analytic in a strip around the real axis of the variables a, f3, so that 
there exist constants F Q , F 1; k such that \&Lf v {0)\ < q\F F^e- K °^ for all v eT and all (3 G T . 

There are quite a few results on the above problem, essentially solved, under the assumptions of 
Theorem 1 below, in Ref. [JLZ], and on closely related problems. We summarize our understanding 
of the existing results in Appendix Al. 

The equations of motion for the system (1.2), written in terms of the angle variables alone, are 

& = -£d a f(a,f3), (3 = -edpf(a,P), (1.5) 

so that, once a solution of (1.5) is found, the action variables are immediately obtained by a simple 
differentiation: A = a — u>, B = (3. 

We look for solutions of (1.5), for e ^ 0, conjugated to the free solution (c* + wt, /3o, 0, 0), i.e. 
we look for solutions of the form 

a(t)=^ + a(^,#,;e), (3{t) = /3 + b(^,#,;e), (1.6) 

for some functions a and b, real analytic and 27r-periodic in ib G T r , such that the motion in the 
variable ip is governed by the equation xj> = oj. We shall prove the following result. 

Theorem 1. Consider the Hamiltonian (1.2), with u> G D ro (Co) and f analytic and periodic in 
both variables. Suppose f3 to be such that 

dpf {(3 ) = 0, (1.7) 

and assume that the eigenvalues oi, . . . , a s of the matrix <9g./o(A)) are pairwise distinct and positive, 
i.e. for some constant a > one has aj, aj — a-i > a > for all j > i = 1, . . . , s. 
Then there exist a constant e > and a set £ C (0,e) such that the following holds. 

(1) For all e G £ there are solutions of (1.5) of the form (1.6), where the two functions a(ip , /3o; s) 
and b(tp, /3o; e) are real analytic and 2-n -periodic in the variables tp G T r . 

(ii) The relative Lebesgue measure of £ n (0, e) with respect to (0, e) tends to 1 for e — > 0. 
(Hi) The functions a, b can be extended to Lipschitz functions of e,ip in [0,e] x T r . 

Remarks. (1) From the literature one might expect that the non-resonance condition on the 
eigenvalues of 9g/o(/3o) could be avoided; see Appendix Al. 

(2) The case of negative e was dealt with in Ref. [GG], with techniques close to the ones introduced 
here, and it corresponds to the case of hyperbolic tori. 

(3) The case of mixed stationarity, i.e. det <9|/o(/3o) ^ and eigenvalues of d^fo(f3 ) of mixed 
signs (with non-degeneracy of the positive ones), can be treated in exactly the same way discussed 
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3: Degenerate elliptic resonances 

in this paper and the above result extends to this case; cf. Theorem 2 in Section 7. 
(4) For e ^ £ the smooth extension in (iii) does not represent parametric equations of invariant 
tori: it just says that their values in the physically interesting set £ (which turns out to have 
dense complement in [0,e]) can be smoothly interpolated in e. Such (non-unique) extensions are 
commonly used for interpolation purposes and are called Whitney extensions. 

The novelty and the purpose of our work is the development of a method of proof based on the 
existence of a formal power series expression for the functions (a, b) and its multiscale analysis 
producing a rearrangement of its terms, involving summing many divergent series, which turns it 
into an absolutely convergent series. 

The paper is organized as follows. In Section 2 we recall the basic formalism, following Ref. 
[GG] , and in Section 3 we give a simple example of resummation. 

In Section 4 we set up terminology and discuss heuristically the ideas governing our resummations, 
by explaining why they have to be performed by a multiscale analysis of the series (which we call 
"Lindstedt series") representing a formal expansion of the quasi-periodic motions in powers of e. 

The singularities are first "probed" down to a scale in which possible resonances between the 
proper frequencies, i.e. the components of u>, and the normal frequencies, i.e. the square roots 
of the eigenvalues of ed 2 ^fo{(3 ), arc still irrelevant. The analysis of such singularities leads to 
what we call non-resonant or high frequency resummations, which can be treated by the method 
of Rcf. [GG], i.e. of the hyperbolic case, in which no resonances at all were possible between 
proper frequencies and normal frequencies (simply because, for the Hamiltonian (1.2) the latter 
did not exist) . Further probing of the singularities leads to what we call the resonant (or infrared) 
resummations: the analysis is more elaborated and it requires new ideas, obtained by combining 
the ideas in Ref. [GG] with the ones introduced in Ref. [Ge]. 

In Section 5 we discuss the non-resonant resummations while the new infrared resummations 
are studied in Section 6 where a "fully renormalized series" is obtained, i.e. a resummation of the 
series defining the formal expansion of the quasi-periodic solution (1.6) of the equations of motion 
(1.5), which we prove to be absolutely convergent. 

The paper is a self-contained discussion of the construction and of the convergence of the re- 
summed series. This includes a self-contained description of the well-known formal series, [JLZ], 
[GG] . Once this is achieved one has to check that the defined functions do actually represent para- 
metric equations of invariant tori: for this we follow, in Appendix A5, the analysis of Rcfs. [GG] 
and [Ge]. 



We look for a formal power series expansion (in e) of the parametric equations h = (a, b) of the 
invariant torus close to the torus a = ip,f3 = f3 



where we have not explicitly written the dependence on /3 . The power series is easy to derive, 
see for instance Ref. [GG]: however its convergence turns out to be substantially harder than the 
convergence proof of the Lindstedt series for the maximal KAM tori. The series constructed below 



2. Tree formalism 
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for our problem, which we still call "Lindstedt series" , is naturally described in terms of trees. The 

Ik) 

coefficients ' can be computed as sums of "values" that we attribute to trees whose nodes and 
lines carry a few labels, which we call "decorated trees" . 

The formalism to define trees, decorations and values has been described many times and used 
in the proof of several stability results in Hamiltonian mechanics. Usage of graphical tools based 
on trees in the context of KAM theory has been advocated recently in the literature as an inter- 
pretation of Ref. [El]; see for instance Refs. [Ga], [GG], [BGGM] and [BaG]. 



root 




On 

Figure 1. A tree 9 with 12 nodes; one has p Do = 2,poi = 2,p„ 2 = 3,p D3 = 
2)Pt> 4 = 2. The length of the lines should be the same but it is drawn of arbitrary 
size. The endnodes Di, i = 5, . . . , 11 can be either nodes or leaves of the tree. The 
separated line illustrates the way to think of the label r) = (7', 7). 

A tree 9 (see Fig. 1) is defined as a partially ordered set of points, connected by oriented lines. 
The lines are consistently oriented toward the root, which is the leftmost point t; the line entering 
the root is called the root line. If a line I connects two points Di, O2 and is oriented from O2 to Oi, 

def def 

we say that O2 ~< t>i and we shall write D 2 = th and £$ 2 = £; we shall say also that £ exits O2 and 
enters Oi. More generally we write C2 < Di if O2 is on the path of lines connecting Di to the root. 
The points different from the root will be called the nodes of the tree. 

Each line from t> to c' carries a pair r\ of labels r\ = (7, 7') ranging in {1, . . . , d} (marked in Fig. 
1 only on some of the lines for clarity of the drawing) . The labels 7 and 7' should be regarded as 
associated with and t>', respectively; hence with each node t) with p entering lines £\, . . . ,£ Po 
one can associate p v + 1 labels 70, 71, ... , 7 PD , with 70 = 7^ D and -fj = 7^. . Also the root line (from 
Do to the root) carries two such labels and the one associated with the final extreme of the root 
line will be called the root label. 

Fixed any ^ € 0, we shall say that the subset of 9 containing £ v as well as all nodes tv ^ and 
all lines connecting them is a subtree of 9 with root 0'; of course a subtree is a tree. 

Given a tree, with each node we associate a harmonic or mode, as called in Ref. [GG], i.e. a 
label Vx> £ IT ■ We shall denote by V{9) the set of nodes and by A(9) the set of lines. The number 
k = \V(9)\ of nodes in the tree 9, equal to the number |A(#)| of lines, will be called the order of 9. 

We call a node with one entering line and harmonic label a trivial node. 

With any line i — £ v we associate (besides the above mentioned pair r/£ = (7^,7^) of labels 
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assuming values in {1, . . . , d}) a momentum label vg G TV defined as 

vt = vi.q = "to- ( 2 - 2 ) 

ICer(») 

We shall assume that no tree contains trivial nodes with the entering line with momentum: this 
is an important restriction, as we shall see. We call degree P(9) of a tree the order of the tree 
minus the number of momentum lines, so that \V(9)\ — P(9) is their number. 

We call 6i/,fc, 7 the set of trees 9 of order k, i.e. with \V(9)\ — k nodes, and 9° k the set of trees 
of degree k, i.e. with P(9) = k. One has 6i/,fc, 7 ^ 0° k . 

Each tree 9 "decorated" by labels in the described way will have a value which is defined in terms 
of a product of several factors. 

• With each node we associate a node factor 

=nV^C8o). ( 2 - 3 ) 

j 

where the labels jj are the p$ + 1 labels associated with the extreme of the ptj lines entering the 
node and of the line exiting it, and the derivatives <9 7 , with 7 = 1, 2, . . . , r, have to be interpreted 
as factors (ii/ ) 7 . Hence F v is a tensor of rank p v + 1. 

• With each line i carrying labels r\t = (7^,7^) and momentum we associate a matrix, called 
propagator, 

H {^-viY (2.4) 
G e = -e- 1 (9|/ (/3o))-' 7 , X(7/, 7i > r), if ^ = 0, 

where x(7^,7^ > r) is 1 if both 7^ and 7^ are strictly greater than r, and otherwise. 

Given the definitions (2.3) and (2.4) define a value function Val, which with each tree 9 of order 
k associates a tree value 

vai w=^( n f *){ n g *)> ( 2 - 5 ) 

where, by the definitions, all labels ji associated with the nodes appear twice because they appear 
also in the propagators: we make in (2.5) the summation convention that repeated 7 labels asso- 
ciated with nodes and lines are summed over, with the exception of the label 7 associated with 
the root (because we do not consider it a node and the corresponding label 7 appears only once in 
(2.5)). Therefore (2.5) is a number labeled by 7 = 1, . . . , d, i.e. Val(0) is a vector. 

Remarks. (1) The trees can be drawn in various ways: we can limit the arbitrariness by demanding 
that the length of the segments representing the lines is 1 (unlike the drawings in the above figures) 
and that the angles between the lines are irrelevant. The combinatorics being very important, 
because it matters in the check of cancellations essential for the analysis, we adopt the convention 
that trees are drawn on a plane, have lines of unit length and carry an identifier label, that we call 
number label (not shown in the above figures) which distinguishes the lines from each other even 
if we ignore the other labels attached to them. Furthermore two trees that can be superposed by 
pivoting the lines merging into the same node , around itself, are considered identical. This is a 
convention which is useful for checking cancellations: however it is by no means the only possible 
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one. Others are possible and often very convenient in other respects [G3], [GM1], but in a given 
work a choice has to be made once and for all. 

(2) A line i carrying momentum is somewhat special. We could visualize the part of the tree 
preceding such lines by encircling it into a dotted circle: such a representation has been used in 
earlier papers, e.g. in Ref. [GG], calling the subtree 9e with t as root line a leaf. Here, however, 
we shall avoid using a special word for the momentum lines and the subtrees preceding them. 

(3) We can think of the propagators as matrices of the form 

Gl= fG e , aa Gt, a p\ (26) 

where Ge,aa, Ge, a p, Ge,p a and Ge t pp are r x r, r x s, s x r and s x s matrices. 

(4) The value of a tree defined above has no pole at e = if Val(0) ^ because every line with 
momentum is preceded by at least two nodes, so that the total power of e to which the value 
is proportional is always non- negative and, in fact, it is necessarily positive: we need to take into 
account that dgfo(/3o) = and that our trees contain no trivial nodes with one entering line with 
momentum. Note that Val(0) is a monomial in e of degree P(0). 

(5) In the case of maximal tori and if Val(0) =/= there are no lines with momentum for systems 
described by the Hamiltonians (1.1): indeed s = 0, see also [Ga]. In this case the number of nodes, 
i.e. the tree order, coincides with the power of e associated with the monomial in e defined by the 
tree value, i.e. with the tree degree. In general, however, the order |V(0)| of a tree can be larger 
than its degree P{6): \V{6)\ > P{6) > ±\V(9)\. 

The above definitions uniquely attribute a value to each tree. The following result states the 
existence of formal solutions to (1.5) which are conjugated to the unperturbed motion (1.4), pro- 
vided the value /3o is suitably fixed. The proof is an algebraic check which does not distinguish 
the possible signs of e and can be taken from Ref. [GG] where it is done in the case e < 0. 

Lemma 1. The Fourier transform of the power series solution h = (a, b) of (1.5) of the form (2.1) 
is obtained by writing (the definition of ©£„ follows (2.2)) 

e k h ( ^= Val W ( 2 - 7 ) 

for all v e HI , all k € N and 7 = 1, . . . , d. 

The expression (2.7) is well defined at fixed k and the sum over k gives what we call the formal 
power series solution for the equations for the parametric representation (2.1), (1.6) of the invariant 
tori. 

3. The simplest resummation 

The power series in e in (2.1) and its Fourier transform defined by the sum over k of (2.7) may be 
not convergent as a power series (as far as we know). The problem is difficult because if in (2.7) 
we replace Val(0) with |Val(0)| the series certainly diverges. 

Our aim, as stated in the introduction, is to show that nevertheless a meaning to the series 
can be given. We shall show that the tree values can be further decomposed into sums of several 
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other quantities and that the various contributions to the series can be rearranged by suitably 
collecting them into families: the sums of the contributions from each family leave us with a new 
series (no longer a power series in s) which is in fact convergent and its sum solves the problem of 
constructing the parametric representations h = (a, b), (2.1), of the invariant tori at least for all 
e G £, with £ a set with as a density point (i.e. a Lebesgue point). 

For this purpose we need to define and consider more involved trees and more involved definitions 
of their values. We begin by remarking that trees may contain trivial nodes, i.e. nodes with 
harmonic separating two lines with equal momentum u ^ 0. 

One can suppose that no tree contains trivial nodes provided we use for all lines, with momentum 
v 7^ and labels 7, 7' associated with the extremes, the new propagators 

,def t 2 „ \-l „<*£/.. .. n^ de f^f° 



g(x;e) = J (x 2 - M )~\ x^u-v, M = J e dp ((3o) ) ' (3 ' 1} 

This is a resummation of many divergent series obtained by adding the values of trees obtained 
from a tree without trivial nodes by "insertion" of an arbitrary number of trivial nodes on the 
branches with momentum u ^ 0: this requires summing series, one per branch of a tree without 
trivial nodes, which are geometric series with ratio given by the dx d matrix z = \z\ can be 

larger than 1 because the s non-zero eigenvalues ea,j, j = 1, . . . ,s, of M are unrelated to x = uj-v. 1 

Therefore replacing Yl^Lo zP by (1 — z)^ 1 is not rigorous and needs to be eventually justified. 
Certainly we must at least suppose that x 2 — M can be inverted: otherwise the values of the trees 
representing the new series might even be meaningless! (i.e. if some lines will have momentum v 
such that det(x 2 — M ) = 0). This happens for a dense set of e's and we have to exclude such e's 
by imposing conditions on the eigenvalues Aj. ]^ = ea,j, j = 1, . . . ,s, i.e. on e. 

For uniformity of notations it is convenient to assume that e is in an interval (e m i n , 4e m ; n ] related 
to the largest eigenvalue A^ ' = a s £ of M by 

Xf ^£a s G I C dc d (\C 2 ,C 2 \, C de J C 2-"o, na > o, (3.2) 

where Co is the Diophantine constant in (1.3) (fixed throughout the analysis); thus Ic is an interval 
of size 0(C 2 ) (i.e. |C 2 ). In other words we find it convenient to measure e in units of C 2 aJ x via 
an integer no. We a priori assume, for simplicity, the restrictions a s £ < Cg and £ < 1. 

To give a meaning to (x 2 — M ) _1 it would suffice to require \x 2 — £aj\ ^ for all j thereby 
excluding "only" a denumerable (dense) set of values of e, of length; however stronger conditions 
will be needed in order to analyze the convergence problems and we begin by imposing them in a 
form which will be useful later. Setting for later use A' ' (e) d = A^ ' , the conditions that we impose 
on Aj°' , i.e. on e, are that for all x = oj ■ v ^ and for all independent choices of the signs + or — 

T(x) d M mm {\\x\- , \x ± ^7) ± y/^7)\ } > 2~^)/^ (3 . 3 ) 



1 Note that since the tree lines arc numbered (i.e. they are regarded as distinct) adding p nodes on a line i changes 
the combinatorial factor fc! _1 in (2.5) into (k +p)! _1 : however the new p lines thus produced can be chosen in 
( fe p P ) wa y s anc ' ordered in p\ ways so that wc can ignore the extra number labels on I and use as combinatorial 
factor {k + p)^.- 1 ( k + p )p\ = Wr 1 . 
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for n suitably large and n suitably larger than no, see (4.2). This excludes a closed set of values 
of e in the considered interval Ic, (3.2): its measure can be estimated without difficulties. Let 

n=T + r + l, (3.4) 

be a convenient, although somewhat arbitrary, choice; then the total measure of the excluded set 
is 

< 2-^°-V/ 2 C 2 K, (3.5) 

where if is a suitable constant; see Appendix A2. Hence the measure of the complement of the 
set Sn -\ where (3.3) is verified is a small fraction of order C 1 / 2 of the measure of the interval Ic, 
whose size is proportional to C 2 , in which we let e vary at least if no is large. 

4. Resummations: semantic and heuristic considerations 

Replacing the propagators x~ 2 of the lines by (x 2 — M ) _1 we obtain a representation of the 
parametric equations h involving simpler trees (i.e. trees with no trivial nodes). The new repre- 
sentation is a series in which each term is well defined if e is in the large set £n a -i C Ic in which 
(3.3) holds. This is quite different from the original Lindstedt series in (2.7) whose terms are well 
defined for all e. 

We should also stress that the resummed series is in a sense more natural: the momentum lines 
now appear as less anomalous because their propagator is much more closely related to (x 2 — Af ) _1 - 
One can say that it is just the latter evaluated at x = with the meaningless entries (i.e. the first 
r diagonal entries) replaced by 0. Another way of saying the latter property is that lines t with 
momentum and labels 7^,7^ < r are forbidden. One should not be surprised by this fact: it is 
the generalization of the corresponding property in the case of maximal tori (r = d) in which this 
means that lines with momentum are forbidden. The latter property goes back to Poincare's 
theory of the Lindstedt series and is the key to the proof of the KAM theorem and of cancellations 
which make the formal Lindstedt series for maximal tori absolutely convergent; see Refs. [El] and 
[Ga]. However the new series is still only a formal representation because it is by no means clear 
that it is absolutely converges. 

The next natural idea is to try to establish convergence by further modifying the propagators, 
changing at the same time the trees structure, until one achieves a formal representation whose 
convergence will be "easy" to check. Once achieved a formal representation which is convergent 
we shall have to check that it really solves the equations for h. 

The modification of the trees structure will be performed by steps: at each step, labeled by an 
integer n = 0, 1, . . ., the propagators of the lines with non-zero momentum will have been modified 
acquiring labels [0], [1], ... [n — 1], or the label [> n], indicating that they are given no longer by 
(x 2 — Mo) -1 but by a matrix proportional to (x 2 — A^'- p ') _1 , if their label is [p], with p < n, or 
(with a different proportionality factor) to (x 2 — Af'-™') -1 , if their label is [> n]; here M\- p ^ are 
suitable matrices. Here and in the following the symbols [< n] and [> n] are consistently used. 
Hence [> n] does not denote the set of scales [p] with p> n, and in fact it is just a different scale; 
likewise [< n] does not "include" \p\ even if p < n. In other words one has to regard the symbols 
[< n], [n] and [> n] as unrelated symbols. This might appear unusual but it turns out to be a good 
notation for our purposes. 

The proportionality factor depends on x and contains cut-off functions which vanish unless 
x 2 — Al'- p J has smallest eigenvalue of order 0(2~ 2p Co); the cut-offs are so devised that if the 
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propagator does not vanish its denominator has a minimum size proportional to 2~ 2p and the ratio 
between its minimum and maximum values will be bounded above and below by a p-independent 
constant. No modification will be made of the propagators of the momentum lines: for uniformity 
of notation we shall attach a label [oo] to such lines. 

Considering trees with no trivial nodes in which each line carries also an extra scale label 
[oo], [0], [1], . . . [n — 1], [> n] a new formal representation of h will be obtained by assigning, to 
the trees, values defined by the same formula in (2.5), with the propagators Gt replaced by the 
new propagators, that we denote gf^ if the line I carries the label [p], with p = oo, 0, . . . , n — 1, 
and gp™' if the line carries the label [> n]. When the line I is on scale [p], with p = 0, . . . , n — 1, 
or [> n] or [oo], then the corresponding propagator will be proportional to (x 2 — Al'-^) -1 or 
( X 2 _ M^)- 1 or (eag/oW)- 1 . 

The construction will be performed in such a way that the matrices (x 2 — M. [- p l ) will be defined 
by series which will be proved to be convergent; furthermore if we only considered the contributions 
to the formal representation of h coming from trees in which no propagator carries the "last label" 
[> n] then the corresponding series would be convergent. 

We express the latter property by saying that the performed resummations regularize the formal 
representation of h down to scale [n — 1], or that the propagators singularities are probed down 
to scale [n — 1]. The problem of course remains to understand the contributions from the trees 
containing lines with label [> n\. The construction will be such that their propagators are also 
properly defined because the matrices M\- n ^ will always be well defined by convergent series (as 
we shall see). However for the lines whose label is [> n] no useful positive lower bound, not even 
n-dependent, can be given on the smallest eigenvalue of the denominators in the corresponding 
propagators. 

We shall say that the lines with scale [> n] probe the singularity all the way down to the 
smallest frequencies or all the way down in the infrared scales. Thus in spite of the convergence of 
the contributions to h coming from trees with labels [oo], [0], [1], . . . , [n — 1] the representation of 
h remains formal. 

Therefore we shall proceed by increasing the value of n trying to take the limit n — ► oo. This is the 
procedure followed in the case of the theory of hyperbolic tori in Ref. [GG]. In that case, however, 
the propagators denominators (x 2 — A^-™') had eigenvalues always bounded below proportionally 
to x 2 . Indeed the last s eigenvalues of M\- n ^ were negative whereas the first r remained close to 
zero within 0(ex 2 ) (a non-trivial property, however, due to remarkable cancellations well known 
in the KAM theory, [Ga]). 

Here the matrices x 2 — M. I- n l will be shown to have the first r eigenvalues differing by a factor 
(1 + 0(e 2 )) and the last s differing by 0{e 2 ) with respect to those of x 2 — Mq (which has by 
construction r eigenvalues x 2 and s eigenvalues x 2 — eaj j = 1, . . . , s). Thus the denominators can 
become small because either x 2 gets close to or because it gets close to ea\, . . . ea s . Therefore 
the regularization will have to be split in two parts. The first part will concern regularizing the 
scales [p] with p such that the eigenvalues of x 2 — M. remains bounded below proportionally to 
x 2 ; we shall call this part of the analysis the high frequencies resummation. The other part, which 
we shall call the infrared resummation, will concern the regularization of the scales [p] , in which x 
can be so close to some eaj that the denominators cannot be bounded below proportionally to x 2 . 

We associate with each momentum u the frequency x = u) ■ u and we measure the strength of 
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this resonance by the integer p if D(x; e) ~ Cq2 2p , with 



D(a;;e) = min a; 2 - Af ] (e) 



def 



Therefore the condition that the resonance strength of the frequency x be bounded below propor- 
tionally to \x\ is that p is not too large compared to no defined in (3.2), so that x 2 stays away from 
the corresponding eigenvalue A^°' (e) by more than a small fraction of the minimum separation 5 
between the distinct eigenvalues. For instance we can require D(x;e) > 2- 2 (" 0+1 )C 2 > (5/4. This 
gives p < n , with 

n ~n + n 7 n d = — 1 + ^ log 2 -, p = -tcl^ 1 min{ai, min{a J+ i — a,}}. (4.2) 

2 p 4 j 

In fact the requirement could be fulfilled with n one unit larger: the interest of using the above 
value of n will emerge later (if s = 1 one interprets p = \). 

We then perform the analysis by defining recursively the matrices M\- P \x;e) for p = 0, . . . , no 
with eigenvalues A^(a;, s) verifying for a suitable constant 7 > 

\\f(x,e)-\f{e)\ < 7 e 2 , P < n , (4.3) 
so that if the label p of the line with frequency x is p < n then one has, if ia s 2- 2 (" +1 ) - 7 e > 0, 

\x 2 - \f{x,e)\ > \D{x,e) + l -D{x,e) - 7 £ 2 > \d{x,e) > 2- 2 ^\x\ 2 (4.4) 
where the last step is obvious if \x\ 2 > 2A^'(e), otherwise it follows from the inequality 

D(x-e) > 2- 2 ^ +1 h- 2no C 2 > 2- 2 (" +1 )A™( £ ) > 2- 2 ("+ 1 )- 1 |a;| 2 - (4-5) 

We can say that for p < no the strength of the singularity is dominated by the distance |x| to the 
origin, i.e. by the "classical" small divisors x~ 2 provided, of course, the matrices x 2 — M\- P \x]e) 
remain close enough to x 2 — Mq (which we shall check) . Furthermore the convergence of the sum 
of all values of trees with no line label [> no] will be performed exactly along the lines of Ref. 
[GG] because the bound (4.4) guarantees that in evaluating such trees one does not probe the 
singularities close to the eigenvalues of M . 

The departure from the method in Ref. [GG] occurs when we consider trees in which lines bear 
the label [> no]. The problem will again be studied by a multiscale analysis which will have to 
be suitably modified to allow probing the new singularities arising from the resonances between 

the frequencies x and the ^/a' ' (e), j > r. The difficulty is that the propagator g^™ ' will not be 

singular exactly at the frequencies \J~^\s) 7^ but at the frequencies fixed by the roots of the 

eigenvalues a'~™ ' (x; e) of the matrices M\- n °\ (x; e). The latter not only are slightly different from 
those of M but will turn out to depend also on x. 

This means that D(x;e) and even A(x;e) = min^ | x 2 — X^ n °\x; e)| no longer provide a good 
estimate of the strength of the singularity, because D, A vanish at the "wrong places". In fact we 
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shall have to perform a multiscale analysis to resolve the infrared singularities, and it will happen 
that at each of the new scales with labels [p], with p > no, the singularities will keep moving. 

Suppose to have regularized the series up to scale [n — 1], with n > no, introducing suitably 
matrices M^- P \x; e), with p = n a ,...,n, thus pushing the probe of the singularities down to 
scales Co2~"; then to avoid meaningless expressions we shall have to impose on the eigenvalues 
of the last propagator, proportional to (x 2 — M\- n \x; e)) _1 , a condition like (3.3). Since the 
eigenvalues depend on n and x this risks to imply that we have to discard too many e's; in the 
limit n — ► oo: when, finally, the singularities will have been probed on all scales, or even for large 
enough scales, we might be left with an empty set of e's rather than with a set of almost full 
measure. 

Physically the difficulty shows up because of the possibility of resonances between the proper 
frequencies of the quasi-periodic motion on the tori and the normal frequencies. It will be studied 
and solved in Section 6 below, while in Section 5 we shall discuss the simpler regularization of the 
series for h on the high frequency scales. 

The spirit informing the analysis is very close to the techniques used in harmonic analysis, in 
quantum field theory and in statistical mechanics, known as "renormalization group methods" (see 
Rcfs. [Fe], [GM1], [GaOl] and [Ga02]). The methods are also based on a "multiscale decomposition" 
of the propagators singularities. We introduced and adopt the above terminology because we feel 
that it is suggestive and provides useful intuition at least to the readers who have some acquaintance 
with the renormalization group approach and multiscale analysis. 



5. Non-resonant resummations 

The resummations will be defined via trees with no trivial nodes and with lines bearing further 
labels. Moreover the definition of propagator will be changed, hence the values of the trees will be 
different from the ones in Section 3: they are constructed recursively. 



Xo 



Xo 



/A 



C 2 /4 



D 



C 2 /i 



D 



C 2 /4 2 C 2 /4 



D 



Figure 2. The first graph is i/>o, the second is xo and the third is Xo = V'lXO- 



Instead of the sharp multiscale decomposition considered in Ref. [GG] here it will be convenient 
to work with a smooth one as in Ref. [Ge]. Let i(>(D) be a C°° non-decreasing compact support 
function defined for D > 0, see Fig. 2, such that 

ip(D) = 0, for D < C 2 /4, 



1>(D) = 1, 



for D>C 2 , 



where Co is the Diophantine constant in (1.3), and let x{D) 



(5.1) 

1 - V(-D). Define also i/> n (D) = 



iP(2 2n D) and Xn(D) = x(2 2 "L>) for all n > 0. Hence tp = tp, \o = X and 
1 = 4> n (A(x;e)) +x«(A(x;e)), for all n > 0, 



(5.2) 



for all choices of the function A(x;e) > 0: in particular for A(x,e) — D(x) with D(x) defined in 
(5.3) below. We set the following notations. 



Definition 1. Letn ,n be as in (4-2) and D(x;e) as in (4.1). 
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(i) Divide the interval Ic = [e m in, 4e m i n ], where e varies, see (3.2), into a finite number of small 
intervals I of size smaller than \e m \ n p, see (4-2), i.e. smaller than a fraction of the minimum 
separation between the eigenvalues 0, ai, . . . , a s . Define 



D(x; I) = mm D(x; e) = min min x 2 — A^- (e) 



mm 



(5.3) 



i in it x 2 — y ^ ' ' 

eel " ' ' eel j 

where j(x) is the smallest value of j for which the last equality holds: exceptionally there might be 
2 such labels. The j(x) is e -independent, by construction, for e € I. 



Remarks. (1) Note that, as a consequence of the definition of the intervals / and of D(x;I) as 
given by (5.3), one has, for all eel, 



mm 

j 



-Af (£ ) 



> 



1 



(5.4) 



(2) If e is in one of the intervals / and x verifies D(x; I) < Cq2 _2 ™° then there is only one value of 
j for which last equality in (5.3) holds. 

(3) We shall fix, from now on, e in one of the intervals I C I c . Remark that D(x; I) is piecewise 
linear in x 2 with slope equal to 1 in absolute value for x in the regions where it will be considered 
(see below) and we simplify the notation by setting 



D{x) d = f D{x;I). 



(5.5) 



(4) The number of intervals I C Ic can and will be taken independent of £ m i n , i.e. of the interval 
Ic where e varies, and equal to a fixed integer < 6 / o~ 1 . 

(5) From now on we only consider trees with no trivial nodes. 

A simple way to represent the value of a tree as sum of many terms is to make use of the identity 
in (5.2). The propagator g(x; e) = g'- ' (x; e) = (x 2 — Mo) -1 of each line with non-zero momentum 
(hence with x ^ 0) is written as 

g^°\x; e) = M^W) 9 l -" ] {x; e) + Xo(D(x)) g^ (x; e) = f g® (x; e) + g^ 1 } (x; e), (5.6) 

and we note that g^(x; e) vanishes if D{x) is smaller than (Co/2) 2 , see Fig. 2. 

If we replace each g^-°\x; e) with the sum in (5.6) then the value of each tree of order k is split 
as a sum of up to 2 k terms 2 which can be identified by affixing on each line with momentum v ^ 
a label [0] or { > l}. Further splittings of the tree values can be achieved as follows. 

Definition 2. For p = 1, ...,nb, let A4^(x;e) be matrices with eigenvalues X^(x;s), p = 
1 , . . . , n; we set M [0] (x; e) = M and M [ ^ n] {x; e) = Y^ =0 M [p] {x;e). Define forO <n<n -l 

Jn] ,_. ^ de_f MP jx)) Iirin Xm(D(x)) 

9 K ' ' x 2 -M^(x;e) 

g{ >_, ](x;£)V up0mL r ,, 7) 

„[>n] r _. _A def U^n=oXm(D(x)) 

9 K : ' x 2 -M[<™](x; £ )' 



Not necessarily 2 k because there might be lines on scale [oo] whose propagator is not decomposed. 
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and g^(x;e) = ipo(D(x)) (x 2 — M ) _1 . We call the labels [n], {> n}, [> n] scale labels. 

Remarks. (1) The products []" „ \ m (D(x)) can be simplified to involve only the last factor: we 
keep the notation above as it is a notation that naturally reflects the construction. The propagators 

play a subsidiary role and are here for later reference. 

(2) The matrices M.^{x;e) will be defined recursively under the requirement that the functions h 
defining the parametric equations of the invariant torus will be expressed in terms of trees whose 
lines carry scale labels indicating that their values are computed with the propagators in (5.7). 

(3) Note that if we defined M^(x;e) = M , i.e. M [p] = forp > 0, then (recall that we consider 
only trees without trivial nodes) we would naturally decompose (see below for details) the tree 
values into sums of many terms keeping obviously each total sum constant by repeatedly using 
(5.2), thus meeting the requirement in Remark (2) above. This would be of no interest. Therefore 
we shall try to define the matrices Ai^(x;e) so that the sum of the values of new trees (with no 
trivial nodes and whose nodes and lines i still carry harmonic and momentum labels as well as 
scale labels [oo], [0], . . . , [n— 1], [> n]) remain the same provided their values are evaluated by using 
the propagators in (5.7) and we shall try to define M\- p]> (x; e), so that there is also control of the 
convergence. 

(4) In other words we try to obtain a graphical representation of h, involving values of trees which 
are easier to study at the price of needing more involved propagators. This is a typical method 
employed in KAM theory [GBG], and in other fields. 

To define recursively the matrices we introduce the notions of clusters and of self-energy clusters 
of a tree whose lines and nodes carry the same labels introduced so far and in addition each line 
carries a scale label which can be either [oo], if the momentum of the line is zero, or [p], with 
p = 0, . . . ,no — 1, or [> no]. Given a tree 6 decorated in this way we give the following definition, 
for n < no- 

Definition 3. (Clusters) 

(1) A cluster T on scale [n], with < n, is a maximal set of nodes and lines connecting them with 
propagators of scales [p], p < n, one of which, at least, of scale exactly [n]. We denote with V(T) 
and A(T) the set of nodes and the set of lines, respectively, contained in T. The number of nodes 
in T will define the order of T , and it will be denoted with kr- 

(ii) The nix > lines entering the cluster T and the possible line coming out of it (unique if 

existing at all) are called the external lines of the cluster T. 

(in) Given a cluster T on scale [n], we shall call ut — n its scale. 

Remarks. (1) For instance if n = the scale of the lines in the cluster can only be [0]. 

(2) Here n < no — 1. However the definition above is given in such a way that it will extend 
unchanged when also scales larger than no will be introduced. 

(3) The clusters of a tree can be regarded as sets of lines hierarchically ordered by inclusion and 
have hierarchically ordered scales. 

(4) A cluster T is not a tree (in our sense); however we can uniquely associate a tree with it by 
adding the entering and the exiting lines and by imagining that the lower extreme of the exiting 
line is the root and that the highest extremes of the entering lines are nodes carrying a harmonic 
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label equal to the momentum flowing into them. 
Definition 4. (Self-energy clusters) 

(i) We call self-energy cluster of a tree 9 any cluster T of scale [n] such that T has only one 
entering line and one exiting line l\, and furthermore X)dgv(t) = 0- 

(ii) The degree of a self-energy cluster is the number of nodes. 

Remark. The essential property of a self-energy cluster is that it has necessarily just one entering 
line and one exiting line, and both have equal momentum (because X)oev(T) u v = 0)- Note that 
both scales of the external lines of a self-energy cluster T are strictly larger than the scale of T as 
a cluster, but they can be different from each other by at most one unit. Furthermore the degree 
of a self-energy cluster is > 2. Of course no self-energy cluster can contain any line on scale [oo]. 

Definition 5. (Self-energy matrices) 

(i) Let ©j^„ 7 be the set of trees of order k with root line momentum v and root label 7 which 
contain neither self-energy clusters nor trivial nodes. Such trees will be called renormalized trees. 

(ii) We denote with S^ n the set of self- energy clusters of order k and scale [n] which do not contain 
any other self- energy cluster nor any trivial node; we call them renormalized self-energy clusters 
on scale n. 

(Hi) Given a self-energy cluster T e S]} n we shall define the self-energy value of T as the matrix 3 



^•^)=t^( n ^ ] )( n (5 - 8) 



where gf 1 ^ = g[™ f l(u; ■ vf,e). Note that, necessarily, m < n. The kr — 1 lines of the self-energy 
cluster T will be imagined as distinct and to carry a number label ranging in {1, . . . , kr — 1}. 

The recursive definition of the matrices A4^ n \x;e), n > 1, will be (if the series converges) 

n — 1 oc n — 1 

MW(x;e)=(Hx P (D(x)))j2 E *) = ( II X p (D(x)))m^(x; s), (5.9) 

where the self-energy values are evaluated by means of the propagators on scales [p], with p = 
0, . . . , n, which makes sense because we have already defined the propagators on scale [0] and the 
matrices M [0] (x;e) = M (cf. Definition 2). 

With the above new definitions we have the formal identities 

00 

where we have redefined the value of a tree 9 G as 

vai ^=^( n 9 [m] {o>-vt\ej)( n f 0' (5 - n) 

£eA(e) vev(e) 



3 This is a matrix because the self-energy cluster inherits the labels 7, 7' attached to the endnode of the entering 
line and to the initial node of the exiting line. 
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with [r]e\ = [oo], [0], . . . , [no — 1], [> no]- Note that (5.10) is not a power series in e. 

The statement in (5.10) requires some thought, but it turns out to be a tautology, see also Rcf. 
[GG], and Ch. VIII in Ref. [GBG], if one neglects convergence problems which, however, will 
occupy us in the rest of this paper. A sketch of the argument is as follows. 

Imagine that we have only scales [oo], [0], . . . , [n — 1], [> n], i.e. we have performed the scale 
decomposition of the propagators up to scale [n — 1] and we have not decomposed the propagators 
on scale [> n] and that we have checked the statement (5.9) and (5.10) (trivially true for n = 0). 

Given a tree 9 € 7 with lines carrying labels [p] with p = 0, . . . , n — 1 or [> n] or [oo], 

we can split the propagators g^- n \x;s) as g^(x;e) + gi- n+1 } (x; e) as in (5.6) with g^(x;e) = 

i/) n (D(x))g[- n ](x;e) and g{- n+1 } (x; e) = Xn(D(x))g^- n ^(x;e). In this way we get new trees which 
in general contain self-energy clusters of scale [n]. We can in fact construct infinitely many trees 
with self-energy clusters of scale [n] simply by inserting an arbitrary number of them on any line 
i with scale {> n + 1}. 

The values of the trees obtained by k > such self-energy insertions on a given line of a tree in 
&k,v 7 can b e arranged into a geometric progression: in fact they differ only by a factor equal to 

the value of the integer power k in g{^ n+1 ) (x; e) (M^ 1 ! (x; e)gi- n+1 } (x; e)) k+1 if M^+V (x; e) 
is defined as in (5.9), where the Vt{x]s) are evaluated by using as propagators g^(x;e), with 
0<p<noTp~oo, for the lines carrying a scale label [p\. Summation over k will simply change 

g{- n+1 } (x; e) into g[- Tl + 1 l (x; e) and at the same time one shall have to consider only trees with 
no self-energy cluster of scale [n] nor of scale [p] with p < n and with lines carrying scale labels 
[oo], . . . , [n] or [> n + 1]. In this way we prove (5.10) for all n < no : we could continue, but for 
the reasons outlined in Section 4, we decide to stop the resummations at this scale. 

In other words the above is a generalization of the simple rcsummation considered in Section 
3. The result is still as formal as the Lindstedt series we started with even assuming convergence 
of the series in (5.9). In fact the consequent expression for h cannot even be, if taken literally, 
correct because as in Section 3 the denominators in the new expressions could even vanish because 
no lower cut-off operates on the lines with scale [> n ] as the third of (5.7) shows. 

To proceed we must first check that the series (5.9) defining M^(x;e) are really convergent. 
In spite of the last comment this will be true because in the evaluation of M^(x;e) the only 
propagators needed have scales [p] with p < n — 1 so that, see the factors ip n (D(x)), Xn{D(x)) in 
(5.7), their denominators not only do not vanish but have controlled sizes that can be bounded 
below proportionally to x 2 by (4.4), i.e. simply by a constant times Cq\v\~ 2t ° , see (1.3). 

In Rcf. [GG] it has been shown by a purely algebraic symmetry argument that, as long as one can 
prove convergence of the series in (5.9), the matrices M^(x;e) are Hermitian and (Af'"' (x; e)) T = 
M[™](— x; e). Furthermore we should expect that the eigenvalues of the matrix M\- n \x] e) should 
be approximately located either near or near eoi, . . . ,ea s at least within 0(e 2 ). 

The expectation relies on Ref. [GG] (see Eq. (3.25)) where the following " cancellations result" 
is derived for no large enough (hence for e small because 2~ 2 "°~ 2 < ea s < 2~ 2 ™°Co): we reproduce 
the proof in Appendix A3 below, adapting it to the present notations. 

Lemma 2. If n is large enough and n < n = n + n (see (4-2)) then the following properties 
hold. 



1/ febbraio/2008; 17:36 



15 



16: Degenerate elliptic resonances 



The matrices M\- n \x;e), x = u> ■ v, are Hermitian and can be written as 

'M [ £ n] (x;e) M% n] (x;e) 
s Mf™\x;e) Mf n] (x;s) 



where the labels a run over {1, . . . , r} and (3 over {r + 1, . . . , s}. 

(ii) One has M-^(x;e) = (M. x; e)) T , so that the eigenvalues of M\- n \x;e) verify the 

symmetry property X^\x;e) — \^\—x,s), i.e. they are functions of x 2 . 4 
(in) Let d^r be right and left x- derivatives, then 

\\M [n] {x,e)\\ < Be 2 e- K ^ n/T , M l ^ n] (x , e)|| < Be 2 a^ 2 , \\d^M^(x,e)\\ < Be, 
\\M^ ] a {x;e)\\<Be-^ /T min{ £ 2 , ex^ 1 }, 

\\M l :l(x;e)\\<Be-^ 2n/T min{ £ 2 , \x\ a, 1 / 2 }, (5.13) 
\\M^(x;s)\\<Be-^ n/T e 2 , 

for n < no and /or suitable no-independent constants B,k\,t > 0; one can take t = tq. 

While Ki is dimcnsionless the constants A',A,B have same dimension (of a frequency square): 
this is the purpose of introducing appropriate powers of a s . 
General properties of matrices and (5.13) imply, see Appendix A4, 

A' < \d e X [ p ] (x;e)\ < A, a] \d± \f {x; e)\ < As 2 , j > r, 



(5.14) 



A'< \d £ (\f\x;e)-\t l] (x;e))\, i ? j > r, 

\\f ] (x;e) X l ;- 1] (x;e)\ < e 2 Be-^'\ j > r, 

\X^(x;e)\ < A min{e 2 , ex 2 a~ 1 }, j < r, 

where A', A > arc n, no-independent constants, and t — to. 

Remarks. (1) The first three bounds on the eigenvalues in (5.14), follow from the first line of (5.13) 
by using the self-adjointness of the matrices M\~ n \x;e); see Appendix A4. The other bounds in 
(5.13) imply the last bound in (5.14); see Appendix A4. 

(2) The natural domain of definition in x of M\ n \x,s), n > 0, will turn out to be D(x) < 
2-2(n-i) (j2 ^ we imagine that it is defined for all x by continuing it as a constant from its limit 
value. In fact this is not important because, as we shall see, only the values of M\ n \x, s) with 
D{x) < 2 -2 ("~ 1 )Co enter into the analysis. Smoothness means differentiability in e e Ic and a 
right and left differentiability in x. The lack of differentiability in x, but the existence of right 
and left x derivatives, is due to the fact that the function D(x) admits right and left derivatives: 
hence lack of differentiability in x appears as an artifact of the method. This lack of smoothness 
(unpleasant but inessential for our purposes) can be eliminated by changing D(x) into a new D{x) 
which is smooth for x 2 between successive A -(e)'s and, at the same time, it is bounded above and 



s = 2 and f(a,(3) = fo{0) + /i(/9)cosai + /2(/3)cosQ2, to lowest order in x,e, one has 
M l a i n] (x;e) = 3 £ 2 x 2 (2^)-i [/ 2( /3 ) + |9 /J / 1I ( / 3)|2] < 5 11 ,„, M [ Jf ] = i £ 2 x(2^)-i 9/3 J( / 2( / 3) + | 9(3 ^( /3 )|2)] i andM^" 1 = 
edlfoiP), u,v = l,2. 
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below proportionally to D(x). But this would make the discussion needlessly notationally involved 
and we avoid it. 

(3) One should also remark that, although we excluded some values of e (i.e. we required £ G £n -i, 
see (3.3)), here all e € Ic are allowed. The restriction on e plays no role in the high frequency 
resummations: so far its only purpose is to avoid divisions by and to assign a finite value to 
contributions to h from trees with with propagators on scale [> n ] (which could be infinite 
because of the lack of an infrared cut-off in their expressions; see the third of (5.7)). 

(4) The bounds on the entries of M^(x;e) in the second and third lines of (5.13) arise from 
cancellations that are checked in Ref. [GG] via a sequence of algebraic identities on the Lindstedt 
series coefficients and the real difficulty lies in the proof of convergence. The algebraic mechanism 
for the cancellations is briefly recalled in Appendix A3, for completeness. 

(5) Loosely speaking (as mentioned in Section 4) the reason why the above result holds with no- 
independent constants, and why its proof can be taken from Ref. [GG], is that if the scales of the 
propagators are constrained to be [p] with p < no the propagators denominators can be estimated 
by 2- 2 (™ +1 )- 2 a; 2 by (4.4) and by the Remark (1) after Definition 1. This means that one can 
proceed as in the hyperbolic tori cases in which boundedness, from below, proportionally to x 2 of 
the propagators denominators was the main feature exploited and no restriction on e had to be 
required, other than suitable smallness. 

The lemma can be proved by imitating the convergence proof of the KAM theorem, see for 
instance Ref. [GG]; however in the following Appendix A3 the part of the proof which is not 
reducible to a purely algebraic check is repeated, for completeness. 

We have therefore constructed a new representation of the formal series for the function h of 
the parametric equations for the invariant torus: in it only trees with lines carrying a scale label 
[oo], [0], . . . , [no — 1] or [> n ] and no self-energy clusters are present. The above lemma will be 
the starting block of the construction that follows. 

6. Renormalization: the infrared resummation 

Convergence problems still arise from the propagators g^- n °\x; e), which become uncontrollably 
large for x = uj ■ v close to the eigenvalues of M because the bound (4.4) which allowed control of 
the divisors in terms of the classical small divisors (i.e. in terms of |a;|) does not hold any more. 
Hence we must change strategy. 

Definition 6. Given d x d Hermitian matrices M\- n ^{x; e), n — no, no + !,••■, with eigenvalues 
}^\x]e), we introduce the following notations. 

(i) The sequence of self-energies A^™'(e) is defined for n > uq by 

4 B] ( £ ) = X f GAPS), e) , Af -I] ( e ) m X f , (6.1) 

provided A^"'(e) > ; n = n ,n + 1, . . .. 

(ii) The propagator divisors are defined for n > n by 

(6.2) 



AW(x;e) 



def 
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where j(x) is the label where the minimum of x — A'- '(e) 



is reached. 



Remarks. (1) The self-energies are denned recursively starting from those of the matrix M whose 
first r eigenvalues are 0. Hence, as long as one can extend the last of (5.14) and as long as the 
self-energies A^"'(e) remain close to the original value \^\ as we shall check for e small enough, 
one has A'™' (e) = for j = 1, . . . , r and A.'"' (e) > for j > r. 

(2) Under the same conditions and if A n \x;e) ~ 2~ 2 ™Cq the label j(x) depends only on M , 
hence it is n- independent, and furthermore it is constant at x fixed, as e varies in the intervals / 
introduced in Definition 1 (because for large n the frequency x is constrained to be close to one of 
the \f ] (e)). 

(3) The name of propagator divisor assigned to A^ n \x,s) in (6.2) reflects its later use as a lower 
bound on the denominator of a propagator, see Remark (7) to the inductive assumption below. 

By repeating the analysis of Section 4 we can represent the function h via sums of values of trees 
in which lines can carry scale labels [oo], [0], . . . , [n — 1], [no], [n + 1], . . . and which contain no 
self-energy clusters and no trivial nodes (i.e. are renormalized trees, see Definition 5 in Section 
5). The new propagators will be defined by the same procedure used to eliminate the self-energy 
clusters of scales [n] with n < no — 1 . However we shall now determine the scale of a line in terms 
of the recursively defined A^ n \x; e) rather than in terms of D(x): the latter becomes too rough to 
resolve the separation between the eigenvalues and their variations. 

Let X^ - 1 {x)= / ICZo Xm(D(x)), Y n (x;e)= f YT m= - o Xm (AH( x; e )) for n > n and Y~n a -\ = 
1: the definition of the new propagators will be 



glno] %f Xji0 _ 1 (x)^ (A^(x-,6))(x 2 -M^(x;s))-\ 
glno+i]^ X 7ro _ 1 (x) XTro (A["»](x; £ ))^ 0+1 (A["»+ 1 ](x; £ ))(x 2 -M^+ 1 ]( a; ; e ))- 1 , 



(6.3) 



def 



X li0 _ 1 {x)Y n _ 1 {x;e)i )n {A^\x ] e)){x 2 - M^{x;e))-\ 



and so on, using indefinitely the identity 1 = ip n (A^ (x; e)) + x„(Al"l (x; e)) to generate the new 
propagators. 

In this way we obtain a formal representation of h as a sum of tree values in which only renor- 
malized trees (i.e. with neither trivial nodes nor self-energy lines, see Definition 4 in Section 4) 
and in which each line I carries a scale label [ni\. This means that we can formally write h as in 
(5.10), with Val(0) defined according to (5.11), but now the scale label [ne\ is such that ni can 
assume all integer values > or oo, and no line carries a scale label like [> n]: only scale labels 
like [n] are possible. The corresponding propagators g^ n ^{u ■ ff,e) will be defined as follows. 

Definition 7. Given a sequence M^-- m \x;e) as in Definition 6, m > 1, let M^(x;e) = 
M^ n \x;e) -M^ n -^(x;e) with = = M so that M^ n \x; e) = J2l=o M[m] ( x '> £ )- 

Setting A^(x;e) = D(x) if n < no, define for all n > 

[n]( , M^ [n] (^e))UZ> X m (A^(x;e)) 
x z — M^- n '(x;e) 
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(for n = this means tp (D(x)) (x 2 — Mq)^ 1 ). We say that gjp = (u • Vf, e) is a propagator with 
scale [n]. The matrices M\ m \x;e) will be defined as in Section 5 for n < no and will be defined 
recursively also for n > no in terms of the self-energy clusters S]} n _ 1 introduced in Definition 4, 
Section 5, setting for n > no (see (5.9)) 

n oo 

M^(x;e) = ( [] X m (A [m W)))E £ V T (x;e), (6.5) 

m=0 k=2 TeS? _j 

where the self-energy values Vt(x;s) are evaluated by means of propagators on scales less than [n]. 
Note that we have already defined (consistently with (6.5))) the matrices M\- n ^ with n < n and 
the propagators on scale [oo], [0], . . . , [no — 1] (so that (6.4) defines also g^ no \x;e)). 

Remark. (1) Some propagators may vanish being proportional to a product of cut-off functions. If 
a propagator of a line has scale [n] and does not vanish then, see (6.4), 

2 -2(„+l) c ,2 < A N( x;£ ) < 2 -2{n-l) G 2 ^ 



(2) Our definitions of the matrices M\- n \x;e) for n > no will be such that given the node 
harmonics of a tree hence by (6.6) the scale [n] that is attributed to a line can only assume up to 
two consecutive values unless the propagator (hence the tree value) vanishes. 

(3) We may and shall imagine that scale labels are assigned arbitrarily to each line of a given tree 
with the constraint that no self energy clusters are generated; however the tree will have a non-zero 
value only if the scale labels are such that all propagators do not vanish. This means that only 
up to two consecutive scale labels can be assigned to each line if the tree value is not zero. The 
"ambiguity" on the scale labels for a line is due to the use of the non-sharp \ and if functions of 
Fig. 2. 

We make an inductive assumption on the propagators on the scales [to], < to < n. 

Inductive ASSUMPTION. Let no = n + n (see (4-2)) and suppose n large enough; then 

(i) For < m < n — 1 the matrices Ai (x; e) are defined by convergent series for all e G Ic and, 
for all x, they are Hermitian, and A4^ m \x;e) = (A4^ (— x, e)) T . Furthermore they satisfy the 
same relations as (5.13), hence (5.14), with n replaced by m, for all < m < n — 1, with suitably 
chosen (new, possibly different) constants K\, A, A', B, t. One can take r = 2t\. 

(ii) There exist K > and open sets m — 0, ...,n, with £^ C Ic, such that, defining 
recursively A' m '(e) in terms o/Aj™ ^(e) for m = no, ■ ■ ■ ,n — 1 by (i) in Definition 6 above, while 

setting A^ m '(e) = A^°' for m = 0, . . . , no — 1, and defining n d — t + r + 1, see (3.4), one has for 
r' m ' (x; e) = min I min 

I- 3 

\E° m \ < K2-i m C 2 , 
for all m < n — 1 and all x. 

Remarks. Assuming validity of the hypothesis for m < n we note a few of its implications. 

(1) So far we have only checked the hypothesis for scales [m] with m < no, as expressed by Lemma 



mm 



x± 



Af ] (e)± 



Al m] ( £ ) }> 



2-2' 



u 



(R 7\ 
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2 in Section 5, i.e. for the high frequency propagators. If (i) is proved also for m = n then we can 
impose (6.7) immediately by excluding a set £^ of e's of measure estimated by 2~^ m C 2 K with 
K a constant that can be bounded in terms of A 1 , A by introducing the constants p m and p' m as 
in (A2.1), with a'°'( £ ) replaced by A^'fe), and proceeding as done in Appendix A2 for the case 
n <n . Note that since the self-energies X^^e) are = Af'(e) for all m = 0, . . . ,n"o — 1 one will 
have, for such m's, = Ic/£n -i, see (3.3). It is very important to keep in mind, in the above 
argument, that the self-energies either are (for j < r) or are close within 0(e 2 ) to the positive 
eigenvalues of M , and they are differentiable in e and to the right and left of each x by (i); see 
(5.14). 

(2) If a line with a scale [n] has vanishing propagator (i.e. g^ (x; e) = because of the \i ip cut-off 
functions in the definition (6.4)) but n differs at most by one unit from the integer n! such that 
g^ n \x;e) ^ 0. Thus if we consider A^(x,e) we can bound it by changing the inequalities (6.6) 
into Cl 2- 2 ("+ 2 ) < AW(x;s) < Cl 2- 2 (™- 2 ). The remark will be useful later when we shall exploit 
it in the discussion of the cancellations which we shall study to check the inductive hypothesis. 

(3) By (5.13) and (5.14), and (I) in Appendix A4, we deduce that A' m '(x;e), hence A' m '(e), do not 

change by more than C B s 2 J2 n >n e~ Kl2 " /<2Tl) , with respect to A'°'(e), if s < ~E\ (and m > n ). 

(4) Hence if e is small enough the self-energies, i.e. A^ m '(e), have distance bounded above by 2a s e 
and below by iemin {ai, mhijjaj+i — cii}} = 2pea s with p defined in (4.2), if e is small enough, 
say e < e 2 • 

(5) Therefore by Remark (4) we see that the distance of \x\ 2 from the closest value A^™' (e) is smaller 

than one fourth, up to corrections 0(e 2 ), the distance between the distinct values of A^ m '(e), if m 
is large enough compared to no, i.e. if 2Cq 2 _2m < p£a s (or m — n >n as implied by the definition 
(4.2) of n). This means that j(x) is to, e- independent and it coincides with the label minimizing 
\x 2 — |A' m '(a;;e)|| for all to > n and all £ £ I. 



(e) = a'°' are x-independent and, by their definition, the same remains true for all 



(6) Af " 11 

A^™' (e) . The self-energy A^ m ' (e) will be thought of as a reference position for the j-th eigenvalue 
on scale [to], to < n — 1. 

(7) As noted in Remark (5) the quantity \x 2 — a'™^ (a;; e) \ is the smallest denominator appearing in 
the value of the propagator of a line with momentum u if (x; e) ^ (here x = u> ■ u). The key to 
the analysis is the check that the quantities Al n l(a;;e) can be used to bound below the denominators 
of the non-vanishing propagators of scale [n]. If \^)Ax\£) < one has x 2 — A^L(a;;e) > x 2 , so 

that the assertion is trivially satisfied: therefore the really interesting case is when a'™^ (x; e) > 
(which includes the cases j(x) > r). li x has scale [n] with n>n one has 

n - v^£M ^ |n - y/HM ~ |\AS)( £ )- 





1 




> 




\x 




2 




1 




> 




\x 




2 



(6.8) 



\x 2 -M [n] (x, £ )\\ > 




having used the lower cut-off ip n (A^ (x; e)) in the propagator (see (6.3)) to obtain the first two 
terms in the second line (and added a further factor 2 _1 in order to extend the result also to the 
propagators considered in Remark (3)), while the upper cut-off x n _i(A[ n_1 l (x; e)) has been used 
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to obtain positivity of the difference between the second and third terms in the second line, after 
applying (5.14), for n > n , to get 

maxl^A^^e)! < Be 2 , j(x) > r, \\ [ $ x) (x; e)\ < Be\x\ 2 < eC 2 2- 2n , j(x) < r, (6.9) 

for some B, so that the last term in the second line of (6.8) can be bounded above proportionally 
to £2~™C . Hence the first inequality in the last line of (6.8) follows if e small enough, say 
£ < £3 for some £3, fixed independently of n. The latter constraint can be achieved simply by 
taking n large enough, sec (3.2). The last implication follows from (6.9) if j(x) < r because 

-j(>)( e ) = °" 0tncrwisc if j( x ) > r and \ x l \ZAj"i)(e), \J^ x ){x-,e) S [^y/eai, 2^/aJe] one has 
(kl + \J ^ x) (.x,e))/(\x\ + \J>^ X -) (e)) > 2 _2 v /ai/a s , as long as e < e 2 (see Remark (4) above): 
implying again the (6.8). Hence At"l(a:;e) can be effectively used to estimate the size of the non- 
vanishing propagators which is, therefore, closely related to the scale of the corresponding lines. 
(8) The Diophantine condition (3.3) and (6.7) will play from now on a key role. We begin by 
remarking that if the inductive hypothesis is proved all lines will eventually acquire a well defined 
scale label: in fact fixed x one cannot have A^(x,s) < 2~ 2 ™Cq for all n because this implies 5 

\\x\ — ^/a'™^(£)| < 2~"Cb , which soon or later becomes incompatible with the first of (6.7). This 
explains why there is no trace left of the propagators g^- n \x, e). 

To estimate the corrections to the self-energy as n increases it is clear that we must estimate the 
size of MS n }(x; e). For this purpose we need the following result. 

Lemma 3. There is s small and constants k\,A,A',B such that if e < e and the inductive 
hypothesis is assumed for < m < n — 1 then the matrix M\ n \x;e) can be bounded by (5.13) and 
the inductive hypothesis holds for m = n. 

Hence the hypothesis holds for all n since we have already checked it for n = 0, . . . , no (Lemma 
2): the new constants K\, A, A' , B will be different from the ones determined in Lemma 2. 

Proof. For n < no the bound (5.13) is covered by Lemma 2. So we can assume n > no + 1. 
Suppose first e G n^ ( l 1 - o _ 1 £ m , with £ m = I c \ so that the Diophantine property (6.7) holds 
for all m < n — 1. Consider a self-energy cluster T in Ugl 2 <S^ n _ 1 . If the entering and exiting lines 
(with propagators of scale [> n]) have momenta v we begin by showing that 

Wx>\>2 {n - 6)/{2Tl) - (6.10) 

oev(T) 

Indeed the cluster contains at least one line i — l<o with propagator which we can suppose to be 
not vanishing and which has scale [n — 1]. We can write in = + o-gv, where erg = 0, 1 and we 
set (jj ■ v = x, i/9 = mev(T) v m j an d finally xg — u> ■ vg. 

Since the line I is not on scale [n — 2] (as it is on scale [n — 1]) it follows from (6.3) that 

\\xt\-yJ^(e)\<2-^C , (6.11) 

5 As \a 2 - b 2 \ < c 2 implies \a - b\ < c for a,b,c> 0. 



l//e6braio/2008j 17:36 



21 



22: Degenerate elliptic resonances 



Therefore if (6.10) does not hold and if <j£ — 0, by the first part of the Diophantine conditions 



(6.7), one finds \\x e \ - y X ™ (e) | > C^ ml2 ^ {n ^ )l ' 1 for all m < n - 1 and for all 1 < i < d, 
which would be in contradiction with (6.11). 

If instead &i = 1 we shall use the second part of the Diophantine conditions (6.7) and get a 
contradiction. Remark that x can be assumed to be on scale [q] with q > n because of the cut-off 

functions in (6.5) so that one has ||x| — y^\'^(e)| < Cq2~ p for p < n — 1. Hence if xi satisfies 
(6.11) we get, by assuming that (6.10) does not hold, 



2 3 -"C > 
> 



Co Co , , 



- 2(™- 2 )/ 2 |^-i/| r i ~ 2( n - 2 )/ 2 |i/0| T i - °' 

for some T],T)e = ±1, which again leads to a contradiction, so that (6.10) holds also in such a case. 

Every node factor contributes to a factor /„„ bounded by F e^ K °^v\ - there are < (4d 2 ) k k\ 

self-energy clusters, 4 fe scales (for each line there are only two scales for which the propagator is 
not zero, and one has to allow also a scale different by one unit from that which corresponds to 
have a nonvanishing propagator, see Remark (3) after the inductive assumption), and N m {T) lines 
of scale m = 0,1, ... ,n in each self-energy cluster T contributing to M^- n \x;e) and not to the 
M^(x; e), with m < n. Thus the bound on MW(i; e) is 



Go ^£ fc Gte~^°^— 'e' G22 " /(2Tl) [] 2 2mA ^ T \ (6.13) 

fe=2 m=0 



for suitable constants Go,Gi,G2, explicitly computable by the above remarks; for k < 4 the expo- 
nent of e can be replaced by 2, see Remark after Definition 4. The estimate of the number M m {T) 
is given in Appendix A3 (cf. in particular Section A3. 4), and gives Af m (T) < B ra Euey(r) \ u v\> 
with E m = 2( 6 - m )/( 2r i) and ti in (3.4), which shows convergence of the series in (6.13) if £ is small 
enough, say e < e. 

We can and shall assume that e does not exceed min{ei , £2 , £3 } , with s\, £2 and £3 introduced 
earlier (see Remarks 3, 4, 7 after the inductive hypothesis). The rest of the argument repeats the 
analysis in Appendix A3 with minor notational changes: we only hint at the details in Appendix 
A3.4. 

Under the considered hypotheses the matrices M. (x; e) are well defined, by the above discussion 
on convergence of the defining series on the set n"TL 1 - Q _ 1 £ m . The symmetry in item (i) is due to 
algebraic identities valid for the Lindstedt series. They are detailed in Ref. [GG], Appendix A5, 
for £ < 0: being of algebraic nature the argument does not depend on the sign of e and it holds 
unchanged in the present case. 

The second and third lines of inequalities in (5.13) embody the cancellations. We need to check 
the cancellations to make sure for instance that the structure of the matrix M\ n \x;e) preserves 
the eigenvalues and the Whitney smoothness: a danger being that the first r eigenvalues become 
"detached" from 0, i.e. no longer can be bounded by ex 2 , see (5.14). For instance a bound like 
0{e 2 ) would not be enough as it would imply that the self-energies A^(e) may become different 
from zero for j < r. 
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Since the function M\ n ^ (x; e) is defined on the complement of a dense open set differentiability in 
the sense of Whitney can be proved (as usual) by computing a formal derivative and then showing 
that it is continuous and that it can also be used as a bound in interpolations. 6 

The computation of the formal derivatives proceeds as the computation of the actual derivatives 
done in the proof of Lemma 2 (in Appendix A3). One proves formal right and left continuous 
differentiability of the matrices M\ n \x;e) on the closed set H^Tj 1 - _ 1 £ m simply by differentiating 
term by term the value of each cluster contributing to M\ n \x;e). This involves differentiating 
matrices like (x 2 — M\- P \x; e))" 1 , i.e. the matrices M^(x; e) with p < n, which are differentiable 
by the inductive assumption, or it involves differentiating the cut-off functions ipp^Xp an d the 
locations A^'(e) with j > r (because ( £ ) = for j < r) which appear in the form A^(x,s) in 
the arguments of the cut-off functions. All such quantities are differentiable in e and right and left 
differentiable in x by the inductive assumption; furthermore all terms arising from differentiation 
either of A4^(x;e) or A^(e), with p < n, appear multiplied by some power of s, so that the 
inductive assumption is found to hold also for p = n (for a similar discussion see Ref. [Ge]). 

Note that A^(x; e) depend on j(x) but as e varies within the interval /, see (ii) in definition 1, 
j(x) is not only e-independent but it is also constant in x for x varying in small intervals near the 
eigenvalues of M and, therefore, in intervals widely spaced because n > n : this is due to the cut- 
off functions which force x to be close to a single eigenvalue if the propagator of the corresponding 
line is different from 0. Hence for n > tTq we do not have to differentiate the function j(x) (neither 
with respect to x nor with respect to e from which it does not depend); for n < no the function 
j(x) is constant to the right and to the left of every point. 

The n- independence of the constants A',A,B appearing in the inductive hypothesis is proved 
word by word as the corresponding argument in Appendix A3; the constant K\ has been estimated 
above (see G2 in (6.13)) and is n-independent. 

The interpolation bound, see footnote 6 , necessary for defining the Withney derivatives, holds 
because in comparing two contributions to M\ n \x;e) with different x or different e the difficulty 
might only come from the comparison of (x\ — M\- p ^ (xe, e)) -1 evaluated at two different points 
and for one line t at a time: this can be done algebraically by using the resolvent identity 

(x']- M^ix'^e^y 1 = [x\ - M l ^ p] (x t , e)) • (6.14) 

• (x'l - x 2 + M [ ^ p] (x' e ,e') - M [ ^ p] (x e ,ef) (x'l - ^^(xj.e 7 ))" 1 , 

which involves only denominators evaluated at x, e's which are in the set where they are controlled 
by the (6.7) and therefore can be estimated in the same way as the formal derivatives. The Withney 
extension is therefore possible keeping control of the bounds for all e's (small as above) and x. The 
dependence on x may involve the functions D{x) (for p < n — 1) so that the differentiability in x 
will be possible only to the right and to the left of each point (this involves a natural generalization 
of Whitney's theorem). 

The cancellations analysis (i.e. the proof of the second and third inequalities in (5.13)) is inductive 
and has been performed several times in the literature, see Refs. [Ga] and [GG]: in Appendix A3 

6 More precisely in its simplest form Whitney's theorem states that if F(x) is a function defined on a closed set C of 
the interval [0, 1] and if there is a continuous function F'(x) defined on C and if for some 7 > and all x,y S C one 
has \F{y) — F'(x)(y — x)\ < j\x — y\ (wc call this an interpolation bound) then there is a continously differentiable 
function F(x) extending F to [0, 1] and with derivative F (x), with max \ F (x)\ < 7, extending F'(x). 
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we have repeated it following the version in Rcf. [GM1] with some minor modifications. The same 
proof applies to the present case (being a purely algebraic check). 

The inequalities (5.13) imply the (5.14) and therefore we get differentiability of the matrices 
M\- n \x]e) and of the self-energies. This allows us to impose validity of (6.7) by excluding a few 
more values of e by Remark (1) to the inductive hypothesis. 

Therefore we conclude that M\ n \x;e) is defined and verifies (5.13) (with suitably chosen con- 
stants Ki,A',A, B) in the same domain s < e, where the matrix M\- P \x;e) is already defined 
for p < n — 1. Of course M\ n ^ will be relevant for our analysis only on the set (~]^ n= — o _ 1 £ m and 
the extension outside such set is only useful to simplify the analysis as it allows us to use freely 
interpolations formulae, mainly to check (6.7). The matrix M\- n ~ x ^ (x; e) verifies the inductive 
assumption although it has physical meaning only for e £ n^ ) =Wo _ 1 £'„, where £ n is the domain in 
which (6.7) holds for m < n. ■ 

Having checked that the series defining the A4^- n \x;e), hence the self-energies, converge and 
verify the bounds in the inductive hypothesis we still have to check that the fully renormalizcd 
series for h, which has thus been shown to make sense term by term, converges and that its sum is 
actually a function h satisfying the equations for the parametric representation of invariant tori. 

To study convergence we can take again advantage of the method, already used in the proof of 
Lemmas 2,3 above to estimate the number of lines on scale n in a self-energy cluster containing 
no self-energy clusters. Indeed also for renormalized trees one can prove a bound like Af m (0) < 
E m Soev(0) \ Vv I f° r ^0 number N m {9) of lines on scales m contained in A(8) with E m fast 

decreasing with m: E m d = 2( 6_m )/( 2ri ) (see Appendix A3). Hence convergence in the region £ £ 
n^L- o _ 1 £„ follows because if we only sum values of trees without self-energy clusters then we can 
use the above bound on Af m (9) . 

The renormalized trees may contain lines of scales [oo] which so far played no role. Their 
propagators are bounded below by a constant times e; however their number cannot be larger 
than ifc in trees of order k (see Remark (5) in Section 2). Therefore they may reduce the factor 
e k normally present in the value of a graph with k nodes to e? k ; hence this will not affect the 
convergence of the series other than by putting a more severe constant on the maximum value of 
e. 

The set complement of £ n in I c , has measure estimated by C 2 1~ n l 2 K for e £ [(^C) 2 , C' 2 ] = 
Ic- Since C = 2~™°Co and n > n — 1 > n this is a very small fraction of the interval Ic and the 
smaller the closer is Ic to 0. This means that the set of e's for which the whole construction can be 
performed has as a density point. Note that the resummation just defined is a real resummation 
of our series only for e £ r\^L- o _ 1 £ n , and there it gives a well defined function. 

The check that the functions h(tp) defined by the convergent renormalized series evaluated at 
ip = ut do actually solve the equations of motion can be performed by repeating the corresponding 
analysis in Ref. [Ge]. The equation that h = (a, b) has to solve is h = eg(d a f(ip + a, /3 + 
b), dgf(ij>+a, /3o+b)) where g is the pseudo-differential operator (u>-u)^ 2 . The proof is of algebraic 
nature and ultimately follows from the fact that the series we are considering is a resummation of 
Lindstedt's series which is a formal solution of the problem. This explains why the various algebraic 
identities necessary for the check actually hold and the proof proceeds exactly as in Section 8 of 
Ref. [Ge]: we reproduce the argument and the chain of identities in Appendix A5. Therefore the 
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proof of Theorem 1 in Section 1 is complete. 

7. Concluding remarks 

The analysis can be immediately extended to the case in which the matrix dpf (f3 ) has some 
non-degenerate positive eigenvalues and some additional negative ones. The negative eigenvalues 
give no problems and they can be treated as in the case of Ref. [GG] in which all eigenvalues are 
negative. The negative eigenvalues do not give rise to new small divisors, unlike the positive ones; 
in more physical language the proper time scales (i.e. real proper frequencies) of the tori cannot 
resonate with the time scales of hyperbolic type (i.e. imaginary) introduced by the perturbation. 
Hence the following generalization of Theorem 1 holds. 

Theorem 2. If the matrix dpfo(Po) * s n °t singular and has pairwise distinct eigenvalues the 
conclusions (i), (ii) and (Hi) of Theorem 1 in Section 1 follow also in this case. 

The present work has developed a combinatorial approach to the proof that the frequencies of 
elliptic type possibly introduced by the perturbation do not resonate with the proper frequencies 
of the tori at least if e is not too special in a small interval [0,e]. i.e. if it is in a set £ C [0,e] of 
large measure near 0: Nevertheless the complement of £ is an open dense set in [0,e]. The results 
hold for the Hamiltonian (1.2) and the special resonances (w, 0) considered: they can be extended 
to the most general resonances of Hamiltonians like (1.2) with a general quadratic form for the 
kinetic part (i.e. with |l • I replaced by |l • Ql with Q a non-degenerate dx d matrix). 

The case of 9g/o(A)) with degenerate eigenvalues seems quite different from the one treated here. 
Degeneracy will be removed to order 0(e 2 ) under generic conditions. However 0(e 2 ) is also the 
order of variation of the self-energies and one has to find a way to perform the resummations even 
between scale n and scale 2n , which is the scale at which the singularities of the propagator are 
split apart and one shall be able to proceed in the same way as we did in the case of non-degenerate 
eigenvalues. 

The Lipschitz regularity in s in Theorems 1 and 2 can be replaced by C k regularity for any k by 
exploiting the comments in Remark (2) to Lemma 2 and Remark (2) in Appendix A3. 2. 

Unfortunately there seems to be no example known in which one can check that the power 
series studied here are divergent as power series in e. Note that the (infinitely many) divergent 
series that have arisen in this paper are obtained by first splitting the coefficient of order k in the 
Lindstcdt power series and then collecting contributions from the different orders in e: the latter 
form divergent series for which we have assigned a summation rule. Therefore we have not proved 
divergence of the Lindstedt series as power series in s: in this sense (unlikely) convergence of the 
Lindstcdt series has not been ruled out (yet). Nor there is any uniqueness result on the value of the 
renormalized series. The latter depends on quite a few arbitrary choices (even in the hyperbolic 
cases): for instance the cut-off shapes in Fig. 2 are quite arbitrary and in principle the allowed e's 
will change with the choice. 

Furthermore, although we have not really checked all necessary details, it seems to us that our 
method also shows that, given a value £o for which the renormalized series converges, one can find a 
complex domain of s which is open, reaches the real axis with a vertical cusp at eo and extends to an 
open region including a segment (—77, 0) on the negative real axis. In this domain the renormalized 
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series should converge taking on the real axis real values parameterizing an hyperbolic torus with the 
same rotation vector. However since there are no uniqueness proofs we cannot guarantee that each 
such extension does not correspond to a different torus (close within any power of e to any other 
torus of the same type). This would signal a "giant bifurcation" that one would like to exclude: 
in Ref. [GG] an attempt was made to show uniqueness by estimating the size of the Lindstedt 
series coefficients aiming at applying the theory of Borel transforms. However we could not prove 
good enough bounds. We obtained k\ a growth with a too large a (given our estimated size of the 
domain of analyticity in e) to apply uniqueness results from the theory of Borel summations . 



The system which is usually studied in literature when the problem of persistence of lower- 
dimensional elliptic tori is studied, is of the form 



where (a,A,p, q) e T r x T x I s x I s . The function P is analytic in its arguments, and £ is 
a parameter in W ; the function P is a perturbation: this means that a rescaling of the actions 
could allow us to introduce a small parameter e in front of the function P. The frequencies of the 
harmonic oscillators are called normal frequencies; the case fi&(£) = fi/t = constant (that is with 
the normal frequencies independent of £) is a particular case, and it is usually referred to as the 
"constant frequency case". Existence of invariant tori for the system (Al.l) was originally proved 
by Mel'nikov [Mel], [Me2], new proofs were produced by Eliasson [El], Kuksin [Ku], and Poschel 
[PI]. The case s = 1 is easier, and it was earlier solved by Moser [Mo]. Later proofs were given by 
Russmann, see for instance Ref. [R] . See also the very recent Ref. [LW] . 

For P = the dimension of the tori is r < d and the variables (q, p) move around stable 
equilibrium points, hence such tori are called elliptic lower-dimensional tori. 

The conditions under which the quoted results are proved are, besides the usual Diophantine 
condition (1.3) on u>, two non-resonance conditions involving one and two normal frequencies (the 
so called first and second Mel'nikov conditions, originally introduced in Ref. [Mel]); in particular 
one has to impose that the normal frequencies are non-degenerate (i.e. they have to be all different 
from each other). 

Recently proofs of existence of elliptic lower-dimensional tori were given by requesting only 
the first Mel'nikov conditions: this allows treating degenerate frequencies. The first result in 
this direction is due to Bourgain [B3], where the ideas introduced in Refs. [CrW] and [Bl] to 
prove existence of periodic and quasi-periodic solutions in nearly integrable Hamiltonian partial 
differential equations were adapted to construct lower-dimensional tori in the finite-dimensional 
Hamiltonian systems (Al.l) corresponding to the case of constant normal frequencies. New proofs, 
extending the results also to the case of non-constant normal frequencies, are due to Xu and You 



An extension of the results of existence of periodic and quasi-periodic solutions describing lower- 
dimensional invariant tori for infinite-dimensional PDE systems has been provided in a series of 



papers, which include Refs. [Ku], [Wa], [CrW], [KP], [P2], [Bl], [B2], [B4], [GM2] and [GMP] . 



On the other hand the problem (1.2) has not been widely studied in literature. It corresponds 
to a degenerate case because in absence of perturbations the lower-dimensional tori are neither 



Appendix Al. A brief review of earlier results 



s 




(Al.l) 



[Y], [XY]. 
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elliptic nor hyperbolic: it is the perturbation itself which determines if the tori, when continuing 
to exist, become elliptic or hyperbolic (or of mixed type or parabolic). 

(i) The case of hyperbolic tori is easier, and it was the first to be studied, by Treshchev [T]. 
Recently the problem was reconsidered in Ref. [GG] , where the analyticity domain of the invariant 
tori was studied in more detail. In the case of elliptic tori the problem was considered in Rcfs. 
[ChW] and [WC], where Treshchev's approach to the study of the case of hyperbolic tori, involving 
a preliminary change of coordinates, is used to cast the Hamiltonian in a form which is suitable 
for applying Poschel's results on elliptic tori: in particular this imposes the same conditions as 
in Ref. [PI] on the normal frequencies which appear after the canonical change of coordinates is 
performed. 

(ii) The existence problem has been also considered in Ref. [JLZ], where elliptic and hyperbolic 
tori were studied simultaneously, again by imposing some non-degeneracy conditions on normal 
frequencies. Ref. [JLZ] does not investigate resummations of Lindstedt's series; it is based on a 
rapid convergence method, close in spirit to the original proofs of the KAM theorem: a concise 
existence proof of lower-dimensional tori is achieved in both the elliptic and hyperbolic cases. 

We stress that in all quoted papers, except Ref. [JLZ] and [T], the problem is considered with e 
(i.e. the size of the perturbation) fixed and the study deals with estimates of the measure of the 
rotation vectors u) for which there exist invariant tori. We suppose, instead, that u> is fixed, hence 
we study the dependence on e of the lower-dimensional invariant tori and, in particular, the set of 
values of e for which the tori survive. 

Our techniques extend those in Refs. [GG] and [Ge], and are based on the method introduced 
in Refs. [E2] and [Ga]. With respect to Ref. [Ge], where existence of quasi-periodic solutions is 
proved for the generalized Riccati equation considered in Ref. [Ba], the main difficulty is due to 
the presence of several normal frequencies. It is not surprising that this generates extra technical 
difficulties: as already noted, it is well known that the case s = 1 is easier; see Refs. [Mo] and 
[C]. An advantage of the present method is that it is fully constructive and gives a very detailed 
knowledge of the solution. 



Appendix A2. Excluded values of e 

Define 



Pwo-i = Jf min { min \d e y/^)\, min \d e y/$He) - d e y/^(e)\}, 

{A2.1) 



/sa s j 



and note that Pn -i is bounded from below proportionally to p, as defined in (4.2), and p^ _ 1 = 1. 
Then (3.3) excludes, for each v, an interval in e whose measure is bounded (using y/a s e < C; see 
(3.2)) by 

2-( n °- 1 >/ 2 £7C , o#bM~ T \ ( A2 - 2 ) 

where the constant K a can be estimated by K — s a J 1 v 
The Diophantine condition on w implies that if (3.3) is invalid then \u\ cannot be too small 

2^sa s p'- o _ 1 +2-^- 1 ^ 2 C \u\^ > \x\ >C \v\- T °. (A2.3) 
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Therefore ^Jea s p'- Q _ 1 > jCq\v\ t ° if no > 3, hence in this case we only have to consider the 

values of v with \v\ > (C /(4^eZp^~ 1 )) 1/ro ■ Since C/2 < y/ebS < C = 2-" C , we get 

the bound (3.5) with n = r + r + 1 and K = K C (AC^pL^ Cq 1 )^-^^ ^ = 

AK yj p/— a _ x Ylv^o [ITpFi • Note that a condition like T\ > r + r is sufficient to obtain both 
summability over v and a measure (of the excluded set) relatively small with respect to that of 
Ic- If n < 3, hence no < 3, the same conclusion trivially holds possibly increasing the value of K 
by a factor 4. 



Appendix A3. Resummations: convergence and smoothness 

To prove Lemma 2, we first show that the series defining M^- n \x;e) for < n < n converge 
and then we check smoothness and the bounds. This is done for completeness as the argument 
is almost a word by word repetition of the analysis in Rcf. [GG], with a few slight changes of 
notations necessary to adapt it to our present notations and scope. To study convergence of the 
series defining M^(x,e), n < no, we remark that we have to consider only trees in which all 
propagators have scales [p] with p < n - Therefore the propagators which do not vanish will be 
such that their denominators satisfy D(x) > 2~ 2 (™ +1 ) \x\ 2 , see (4.4), so that they are effectively 
estimated from below by \x\ 2 times a constant. Note that the case n — is obvious (and it is 
treated in Section 3). 

A3.1. Convergence. We suppose that the eigenvalues of M\- p ^ (x;e), n = 0, . . . , n — 1, differ from 
the corresponding ones of A^- l(x;e) = Mq so that |A^'(x,e) — A^°'| < 7s 2 for some 7 > 0, and 
that e is small enough so that 7s 2 < iea^ -2 " -2 and, therefore (see (4.4)), if a line with frequency 
x has scale [p], p < n, then \x 2 - A^ pI (x,e)| > 2- 2 ("+ 2 ).t 2 . 

We shall use that if a the propagator of a line is on a scale [n] then one has D(x) < 2~ 2 (™~ 2 )Cq , 
even though we could allow also a bound D(x) < 2 _2 (™ _1 'Cq . The reason for this is again for later 
use in bounds necessary to establish the needed cancellations as commented in Section A3. 2. 

Consider a renormalized self-energy cluster T e <5^ l n _ 1 , and define A m (T) = {£ e A(T) : ng = 
m}, for m < n — f , and "P(T) the set of lines (path) connecting the external lines of T. 

If v is the momentum flowing in the line entering T then the momentum flowing in a line £ G A(T) 
of scale \p], p < n — f , will be + agv, ag = 0, 1, where is the momentum that would flow on 
I if v = 0. The corresponding frequency will be x' t = x { \ + agx, with obvious notations. 

First of all we shall prove the bound 

l^ol > 2 ( ™^ 5)/T0 . (A3. 1) 

oev(T) 

for T e <S/?„_i- If there is a line I € A„_i(T) which docs not belong to V(T) then xg = x®, so 
that (A3.1) follows from the Diophantine condition on u>. If all lines in A„_i(T) belong to V(T) 
consider the one among them, say £, which is closest to l\, i.e. the entering line of T. Then call 
Ti the connected set of nodes and lines between 7 i and £\. If T\ is a single node then v v ^ 0, 

The lines between two lines l\ and £2 with £2 < £1 arc all the lines which precede £\ but which do not precede £2 
nor coincide with it. 



1/ /ebbraio/2008; 17:36 



28 



29: Degenerate elliptic resonances 

otherwise would be a trivial node; if T\ is not a single node then by construction all the lines of 
Ti have scales strictly smaller than n, hence xg ^ x otherwise T\ would be a self-energy cluster. In 
both cases one has \x£ — x\ = \x®\ > Cq\ X)oev(Ti) u v\~ Ta ■ On the other hand both D{x) and D(xe) 
must be < (C 2-("- 2 ) +1 ) 2 hence, by (4.4) \x\, \x e \ < G 2-"+"+ 3 , so that \x - x e \ < C 2- n +"+ 4 , 
and (A3.1) follows also in such a case. 

The next task will be to show that the number Af m (T) of lines on scale [to], with to < n — 1, 
contained in a cluster T is bounded by Af m (T) < max{E m X)oev(T) l^ol — 1^ 0}, with E rn = 
E2~ m / T ° for a suitably chosen constant E; as it will emerge from the proof one can take E = 
2 2(™+ 4 )/ T <\ 

Before considering clusters we adapt to our context the classical bound (Siegel-Bryuno-Poshel; 
see Ref. [Ga] and references quoted therein), stating that, if Af m (9) denotes the number of lines on 
scales [to], then by induction on the number of nodes of 9 one shows: N m {9) < max{E TO X^uev(e) \ v * 
1, 0}. Indeed if 9 contains only one node C and the frequency x = u> ■ v 0o of the root line has scale 
[m] one has 

2- m+1 C Q > ^D{xj>2-^ + ^\x\ > 2- {Tl+ ^C \v Va \- Ta |i/„ | > 2 {m -^- 2 ^ T0 , (A3.2) 

hence E m \i/ 0o \ — 1 > 2 and the bound holds in this simple case. 

If 6 has k nodes and the root line does not have scale [m] the inductive assumption, if it is 
assumed for the cases of k' < k nodes, gives the bound for fc-nodes trees. 

If the root line has scale [to] then on each path of tree lines leading to the root we select the line 
among the ones on scales [to'] with to' > to closest to the root (if any is found on the path) and 
we call the selected lines t\,...,l q . If q ^ 1 either the bound follows just as in the case of k = 1 
(when q = 0) or from the inductive hypothesis (when q > 2). 

The case q = 1 and [n^ 1 ] = [oo] (i.e. vt x = 0) can be treated as the case q = 0. If q = 1 
and vi x 7^ 0, by construction all lines between the root line I and £\, see footnote 7 , have scales 
[to'], with to' < to, so that such lines, together with the nodes they connect, form a cluster T. 
The frequencies xe and xg 1 must be different because the tree 9 contains no self-energy clusters. 
On the other hand y/D(x e ), \/D{x tl ) < 2- ,n+1 C , hence \x e \, \x tl \ < 2- m +™+ 3 C* by (4.4), and 
C Q \vt-v tl \-^ < \x e -x tl \ < 2-" l +"+ 4 C , so that we get E oe v(T) M > (2-" l+ "+ 4 )- 1 / T0 , which 

gives Af m {9) < 1 + £mEtjey(8) - E ™Y.x>ev(T) l^ol - 1 < E mY.x>ev(0) \ v * \ ~ 1 ' so that thc 
bounds is completely proved. 

Remark. The above discussion exploits thc property that the tree 9 that we consider cannot, 
by definition of renormalized tree, contain self-energy clusters, and follows Ref. [GG] which was 
based on the possibility of bounding the denominators proportionally to x 2 (in that case the 
proportionality factor was 1): a property also valid here for n < no- 

For thc bound on N m (T) we consider a subset Go of the lines of a tree 9 between two lines 
£ ou t and £i n . Set G = Go U /i n U £ ut- Let [p;„], [p ut] be the scales of the lines £ ou t and £- in , 
respectively, and suppose that pi n ,Pout > m, while all lines in Go (if any) have scales [p] with 
p < n — 1. Note that in general Go is not even a cluster unless Pi n ,Pout > n. Then we can prove 
that A/" m (Go) < max{E m J2v£V(G ) 1 1/0 1 — 1>0}> w hcrc F(Go) is the set of nodes preceding £ out 
and following and E m is defined above. If Go has zero lines then the harmonic uq of the (only) 
node in V(Gq) is large, \vq\ > 2( m_n_2 )/ T ° (by the Diophantine property) and the statement is 
true. Hence we proceed inductively on the number of lines in Gq. 
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If no line of Go on the path V(G) connecting the external lines of G has scale [to] then the lines 
in G on scale [to] (if any) belong to trees with root on V(G), and the statement follows from the 
bound on trees. 

Suppose that t G V(G) is a line on scale [to], then call G\ and Gi the disjoint subsets of G such 
that Gi U G 2 U I — G. Then G\ U I and G 2 U I have the same structure of G itself but each has 
less lines: and again the inductive assumption yields the result. 

Therefore, as a particular case, by choosing Go = T, with T e <5^ n _ 1 , the bound for Af m (G) 
implies the bound on J\f m (T) we are looking for. 

The above analysis is taken from Ref. [Ge] and differs from Ref. [GG] because here the scales 
depend on e and it is not clear how to define a "strong Diophantinc condition" , which would allow 
a one-to-one correspondence between line scales and line momenta. 

The bound on the contribution of a single self-energy cluster T e S]} n _ 1 is then 

k m o 

£ -C~ 2k F 2k e~^ Ko So ^ 1 ^ Y\ 2 (m+3)2 - /v ' m(T) ^ • (A3. 3) 



m=0 

.( e -^E, 6 v ( T,l^l f[ 2 2(m+3)2 ~ m/T0B £-v (T) l^l) < f^e-^E,^^!, 



m—niQ + l 

with F an upper bound on the constants Fq , F\ bounding the Fourier transform of the perturbation 
(see (1.4)), while to is defined so that log2^ m>mo 2(to + 3)2~ m / T °E < |«; and G is a suitable 
constant. 

The number of trees can be bounded by 4 fc fc!, and the sum over the scale labels involves at 
most 2 possible values per line because of the upper and lower cut-offs present in the propagators 
definition. The sum over the harmonics can be estimated by making use of part of the exponential 
factor in (A3. 3) (say \ko) while the other jKq will be used as a factor bounded by e~3 K ° 2< " ™ 5)/n> , 
by (A3.1). 

Hence we get convergence at exponential rate 2~ 1 for e < e\ (and e\ is an explicitly computable 
constant) and the matrix M^ n \x;e) is defined by a convergent series and it is bounded by 

\\MW(x;e)\\ <Bs 2 e--^ {n ^ l)/ ^ , (A3.4) 

for a suitable B which can be read from (A3. 3), i.e. we get the first of the first line in (5.13) with 
the constant B replaced by B, r = to, and k\ = jK e^ < - n+4 ^ T0 . The e 2 factor is due to the parallel 
remark that, in any self-energy cluster whose value contributes to M\ n \x;e), k is certainly > 2 
(see Remark to Definition 4 in Section 5). 
Therefore if e is small enough (that is smaller than a constant independent of n < no) 

oo 

\\M^ n \x-s)-M Q \\<Be 2 Y J ^ Ko2in ^ )/Ta = f B'e 2 , (A3.5) 

71=1 



so that the eigenvalues of M\- n ^ [x; e) will be shifted with respect to the corresponding eigenvalues 
of M by 7£ 2 at most, with 7 d = B'C, see (I) in Appendix A4. 

Hence if we define 7 as B'C and e is chosen small enough, say e < £2, so that 7s 2 < iea s 2~ 2 ™~ 2 
(as it must be in order that the above argument be consistent, see the beginning of the current 
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Section) we obtain the validity of the assumed inductive hypothesis for all n < no and of the first 
inequality in the first line of (5.13) where B can be chosen equal to B above. 

The symmetries in items (i) and (ii) are an algebraic consequence of the form of the Lindstedt 
series: hence they are a necessary consequence of the proved convergence, see Ref. [GG]. 

A3. 2. Smoothness. The function M^(x;e) which we have just shown to be well defined for all 
e small enough will be smooth in e,x. We assume inductively that this is the case for MH(i;£), 
< p < n — 1, and that the bounds in the first line of (5.13) hold for such p's (the case p = is 
obvious as M\°\x;e) = M ). 

Each derivative with respect to x or, respectively, to e will replace the value of a self-energy 
cluster with k nodes by a sum of k terms which can be bounded by a bound like (A3. 3). 

In fact, given a self-energy cluster T, the right derivative <9+ may fall on a denominator of one of 
the k — 1 cluster lines. If its frequency is x + Xq with scale label [m], derivation yields, up to a sign, 
a product of two matrices ((x + x) 2 - M^- m ^(x a + x; e)) _1 times 2 (x + x) — d^M^- m \x +x; e) 
with an appropriate order of multiplication. The term 2 (x + x a ) ((x + x) 2 — M\- m \x$ + x; e))~ 2 
can be bounded proportionally to (C ~ 2 2 2 ( m ~ 1 ' 1 ) 3 / 2 < (C _2 2 2 ( m_1 )) 2 , while the remaining term 
can be studied by making use of the inductive assumption ||9 a: Al'- m '(a;o + x; e)\\ < Be 2 a s ^ 2 and 
it leads to the same bound found for the first term, i.e. (Cq~ 2 2 2 (" 1-1 )) 2 , multiplied by Be 2 . 8 

If the derivative falls on either a ip p or a Xp function, we can use that such derivative can be 
bounded proportionally to C ~ 1 2 p and Yl™=o 2 P = 2 m , to obtain again the same bound as the first 
case. 

Hence the final bound has the form B\ + e 2 Bb with B\,b suitable constants, provided e is small 
enough, say e < £3. The value of the constants B\,b do not depend on the inductively assumed 
value for B: in particular B\ can be obtained (see Remark (2) below for a smarter bound) by 
replacing 2^ m+3 ' in the two factors in the l.h.s. of (A3. 3) by 2 2 ( m+3 ) and by inserting a factor k 
times a constant (to keep track of all the constant factors arising from differentiation). Therefore 
if B = 2Bi the estimate on <9+ 'M\- n \x; e) follows if e is small enough, say e < £4. The same can 
be said about the left derivative d~ . 

The right and left differentiability of M\ n \x;e) with respect to x is due to the dependence of 
A4^(x;e) on the function D(x): the latter has a discontinuous derivative at a finite number of 
points (roughly at midpoints between the eigenvalues A^°' of Mo). 9 Note that the denominators 
in the self-energy values defining M^(x;e) cannot vanish, and actually stay well away from 0, 
permitting the above bounds, because of the lower cut-off ipo(D(x)) appearing in the definition of 
the propagators (x; e)(x; e); see (5.6) and (5.7). 

The same argument holds for d e : however the bound will be only Be instead of Be 2 because 
the derivative with respect to e might decrease by one unit the degree of the self-energy values 
involved. Thus the first line of (5.13) is completely proved. Of course for each of the three terms 
we get a different constant _B, but for simplicity we use for them all the largest, still calling it B. 

Remark. (1) We could also prove existence of higher x, £-derivatives of M\ n \x\e) and of its 

8 Since the matrix M' m l (xq + x; e) is generated by self-energy clusters clusters of degree at least 2. 

9 One could avoid having only left and right differentiability by using a regularized version of the function D(x) as 
discussed in Remark (2) after Lemma 2 in Section 5. 
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eigenvalues A'"' (x, e) for j > r via the above argument. 

(2) The more derivatives we try to estimate with the above method the smaller would become the 
set of allowed values of e. This however is avoidable. Instead of imagining to include the bound 
Cq~ 2 2 2 " 1 arising above as a consequence of the "extra" D(x + xq) or of the other derivatives into 
the factors 2 m+3 associated with the divisors in (A3. 3) one could simply further bound this by 
Cq~ 2 2 2ti and use part of the factor e~i Ko2< " ]l (say replacing jKq with |«o) : this eventually 
leads to a bound on the s-th right-derivative with respect to a; of a value of a self-energy cluster 
proportional to 2™ s e~s K o 2( " " 4)/T ° but with an s-independent estimate of the radius of convergence 
(as the constant G in (A3. 3) remains the same). This is sufficient to get the existence of the s-th 
derivatives without any further restriction on s: and a similar argument holds for the ^-derivatives. 

A3. 3. Cancellations. Only the bound in the fourth line of (5.13) follows from those in the first 
line. The bounds in the second and third lines express remarkable properties of Lindstedt series 
and are essentially algebraic properties: they are the "same" cancellations which occur in KAM 
theory, see Refs. [Ga], [GM1], and are based on the remark that if T is a self-energy cluster the 
entering and exiting lines have the same momentum v. hence the sum of the harmonics of the 
nodes of T vanishes J2vev(T) v * = 0- 
We start by dealing with the trivial cases. Consider first self-energy clusters T such that 

£ Wx>\ > (q>/2 6 M) 1/ro - 043.6) 

OGV(T) 

For such a self-energy cluster T one can use part (say 1/8) of the exponential decay of the node 
factors to obtain a bound e ~~St>ev ( T) l"° < e -6i|xr 1/2T1 < b 2 x 2 , with b\ and b 2 two suitable 
positive constants, while a factor e 2 simply follows from the fact that any self-energy cluster has 
at least two nodes. 

So we can assume that (A3. 6) does not hold. If v is the momentum flowing in the entering line 
then the momentum flowing in a line I e A(T) of scale [p], p < n, if the scale of the cluster is 
[n], will be + atis, <ti = 0,1, where is the momentum that would flow on t if v = 0. The 
corresponding frequency will be xi — x® + cr^x, with obvious notations. 

If the entering and exiting lines are imagined attached to the internal nodes of T in all possible 
ways (i.e. in k 2 ways if T contains k nodes) keeping all their labels unaltered then one obtains a 
family Tt of self-energy clusters. The contribution of each self-energy cluster of Tt to each of the 
entries of the matrix M\ n ^(x;e) with labels i,j<r (the aa entries in the notations of Lemma 2) 
and with labels i < r,j > r (the a(3 entries) has the form M i} j. v> xo (x, T) v$ ti or, respectively, 
M/ j;0 (x,T) with 

M iM (x, T) = Mij (T) + xM$ (T) + .r 2 .!/,.%.„, (x, T), i, j < r, 

(-] \ ^-Ao. ( ) 

M( j;0 (x, T) — M- j (T) + xM!% (x, T) , i<r,r< j, 

so that after performing the sum over the self-energy clusters of Tt, he. after performing the sums 
So roev(T) or > respectively, X)oeV(T)' ^ ne ^ rs ^ ^ w0 terms in the first line and the first term in the 
second line do not contribute because J2 v> v =0. However one has to show that the matrices M 
and M' in the r.h.s. of (A3. 7) satisfy appropriate bounds once the factors x determining the order 
of zero at x = are extracted. From the convergence one expects that the bounds should still be 
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proportional to e 2 while the derivatives d^r or d E should satisfy bounds proportional to e 2 or to e 
respectively. 

The (A3. 7) are proved by means of interpolations, see [GM1], between the contributions of 
the self-energy clusters in the family JF T . When we collect together the values of the self-energy 
clusters in Tt then the arguments of some of the propagators can fall outside the supports of the 
respective cut-off function (because the lines are shifted but their scale labels are kept fixed so that 
scales of the propagators of the self-energy clusters T" <E Tt are the ones inherited by T while the 
momentum flowing in them may change). 

This generates trees and clusters for which we made no estimates (because they are just 0). 
However when interpolating we may end up computing values of trees, with scale assignments 
which would give a value 0, at intermediate frequencies where the values no longer vanish. In 
estimating such interpolated values we can proceed as in the cases already treated, but it will not be 
necessarily true that a line of frequency x and scale [n] will satisfy 2~ 2 (™~ 1 'Cq < D(x) < 2~ 2 ™Cq. 
Nevertheless a slightly weaker version of this inequality has to hold in which the the l.h.s. is 
divided by 4 and the r.h.s. is multiplied by 4 (cf. also Remark (3) after the inductive assumption 
in Section 6), and the estimates will not only be possible but they can be regarded as already 
obtained because, as the reader can check, we have been careful in discussing the bounds obtained 
so far under such weaker condition. This also clarifies why we have defined n in (4.2) one unit 
larger than what appeared there as necessary so that the estimate (4.4) is apparently worse than 
it should. 

In some cases, however, a serious problem seems to arise when actually attempting to derive 
bounds: namely the bounds on the matrices which appear as coefficients in (A3. 7) can really be 
checked as just outlined by the above hints, and without affecting the values of e for which one has 
convergence, only if x verifies the condition of being so small that the variations of the momenta 
flowing in the inner lines of T, when the entering or exiting lines are moved and re-attached to all 
nodes of T, remain so small that the quantities D{xi) corresponding to the lines t in the cluster T 
stay essentially unchanged. 

In certain cases shifting the entering or exiting lines to the nodes of the self-energy cluster T 
may considerably change the scales of the lines I in T, but this is the case in which (A3. 6) holds. 
And precisely in such a case the cancellations are not needed to prove the bound, because we have 
checked that the value of each self-energy cluster contributing to individually already verifies 
that bound that we want to prove. 

If (A3. 6) does not hold, then two cases are possible: cither |x| is close to A^ p ' for some j > r or 
larger, and no cancellation occurs, or \x\ is < Co2~ n . In the latter case the inequality opposite 
to (A3. 6) implies that for t e A(T) one has \x° t \ > 4\x\, hence 2\x° e \ > \x e \ > so that the 

scales can change by at most one unit by shifting the external lines of T. Then the quantities 
D(xg) do not change much for all lines £ G A(T), and we shall have the cancellation through the 
mentioned mechanism. Therefore the contribution of A4^(x;e) to M\- n ^{x;e) can be bounded 
in both cases proportionally to e~3 K ° 2< " " 4)/n> times min{e 2 ,e \x\ 2 } for the entries aa or times 
min{e 2 , £2 |x|} for the a(3 entries: either by the cancellation (second case) or by the general bound 
0(e 2 ) on matrix elements (first case), because x 2 is of order 0(e). 

Finally we note that in the estimates of the M's in (A3. 7) we have to sum over the scale labels 
and this gives a factor per line larger than the one arising in the bound (A3. 3) (which was 2); in 
fact we have to consider also trees with vanishing value: but the scales of the divisors associated 
with their lines can change at most by one unit with respect to the scale, hence we can have at 
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most 4 scale labels per line. 



Remark. We stress once more that the above analysis holds if e is small enough, say e < E\ with s\ 
determined by collecting all the (three) restrictions imposed by requiring e to be "small enough" , 
derived above and Si is independent of no (otherwise it would be uninteresting) . The reason is that 
as long as we do not deal with x's which are too close to the eigenvalues of M , so that the key 
inequality (4.4) holds, we do not really see the difference between the hyperbolic and the elliptic 
cases: and in the hyperbolic cases there is no need for a lower cut-off at scale ~ no where resonances 
between the proper frequencies (which are of order e) and the elliptic normal frequencies become 
possible (as e ~ C^ 2 ™ ). 

A3. 4. Resonant resummations. Concerning the proof of Lemma 6 we only need to add a few 
comments. The bounds on M m {6) and Af m (T) can be discussed exactly as for the scales [n] with 
n < no, with the only difference that now one has to use also the second part of the Diophantine 
conditions (6.7), as already done in the argument leading to (6.12); in particular the role of the 
exponent r is now played by 2t\ (because of the Diophantine conditions in (6.7) which replaces 
(1.3) in the discussion), while in the analogues of (A3.1) and the following bounds no n appear, 
as the propagator divisors are bounded directly in terms of the corresponding scales, and not in 
terms of the frequencies. 

Also the argument given above about the cancellations extends easily to the scales [n], with 
n > no- The only difference is that in (A3. 6) the exponent 1/tq has to be replaced with 1/(2ti), 

in such a way that for any line £ G A(T) one has \\x® \ — \JJ}^~^{e)\\ > A\x\, hence the chain of 
inequalities 
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(A3.8) 



follows, and again by shifting the external lines of T the scales of the internal lines can change at 
most by one unit, when (A3. 6) is not satisfied 



Appendix A4. Matrix properties 

(I) Let M be adxd Hermitian matrix with eigenvalues Ai < . . . < \ with multiplicities m, . . . , n p 
and eigenspaces Yl\, . . . ,U p on which we fix orthonormal bases ej k ,j = 1, . . . ,p, k — 1, . . . ,n,j. Let 
Mi be Hermitian and consider the matrix M = M + eM\ . There exists a constant C such that, for 
e small enough, there will be rij eigenvalues of M (not necessarily all distinct) which are analytic 
in e and one has \\j,k(e) — ^j,fe'( e )l — &e for k,k' = 1,.. . ,rij. 

Hint. If rij = 1 this follows immediately form the formula Xj(e) = Tr § y . z-li ) > wnere 7j i s a 
circle around Aj(0) of e-independent radius smaller than half the minimum separation 5 between 
the Xj for e small enough (so that C\£^ < S for a suitable C\) w . 

" Because the characteristic polynomials -P(A), Po(\) are related by P(\) = Po(X) + eQ(X, e) with Q of lower degree. 
Therefore there is L such that if |A| > L then for all |e| < 1 (say) it is P(X) 0. Furthermore if all roots of P differ 

by at least y from those of Pq one has |P(A)| > y d — eC d where C d = maxiA|<L,| e |<i |Q(A,e). Hence y < Ce d 1 . 
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Otherwise it follows form similar formulae for the projection operator Ej on IX,- and for EjMEj-. 

E . = — I EjMEj = — I Et-^-E*, (AAA) 

3 2niJ 7 . z-M' 3 3 2niJ 7 . 3 z - M 3: 1 ' 

which, for e small, can be expanded into a convergent power series in e (as done explicitly in a 
similar context in (A4.3) below) because of the e-independence of the radii of 7j. One can also 

construct an orthonormal basis on IL, with vectors of the form \rj_ k = ej.k + J2^>i £<le< j\l (shnphy 
applying the Hilbert-Schmidt orthonormalization to the vectors Ejejj,,k = 1, . . . ,n,j). One then 
remarks that the matrix EjMEj has nj eigenvalues and that it has the form A., + eM(e). 

So the problem is reduced to the case in which M is the identity perturbed by an analytic 
matrix. Either M(e) is proportional to the identity and there is nothing more to do, or it is not: 
hence there will be an order in e at which the degeneracy is removed and repeating the argument 
we reduce the problem to a similar one for matrices of dimension lower than nj\ and so on until 
we find a matrix (possibly one dimensional) proportional to the identity to all orders. ■ 

In our analysis we need the following corollary. 

(II) Let M be Hermitian with r degenerate eigenvalues equal to and s = d—r simple eigenvalues 
ea , j = l,...,s. 

(i) The matrix M + s 2 Mi with Mi Hermitian and differentiable in e with bounded derivative will 
have s non-degenerate eigenvalues eaj + 0{e 2 ), j = 1, . . . , s, and r eigenvalues Ai(e), . . . , X r (s), 
all analytic in e, with the property that for all k = 1, . . . ,r one has |Afe(e)| < C e 2 , if e is small 
enough and C is a suitable constant. 

(ii) If M\ depends on a parameter x and is differentiable also in x with bounded derivative then 

\d x Xj(x;e)\ <Ce 2 , \d £ Xj(x;s)\ < C, j > r, 
\Xj{x;e)-Xj{x';s)\ < Ce 2 \x - x'\ 1/r , j < r, 

if e is small enough and C is a suitable constant. 

The second relation in (A4.2) is not used in this paper and is given only for completeness. 

Hint. We apply the previous lemma to the matrices e~ 1 M and eM\ and we get (i). To get 
(A4.2) wc note that the x-derivative of M + e 2 M x is e 2 d x M 1 and the first of (A4.2) follows. To 
obtain the second of (A4.2) we have to compare the eigenvalues of M + e 2 Mi(x;e) with those of 
M + s 2 Mi(x; e) + e 2 0(\x — x'\). By the above expression for the projection on the plane of the 
first r eigenvalues this is reduced to the problem of comparing two r x r matrices of order e 2 and 
differing by 0(\x — x'\). The power 1/r arises from the estimate that the considered projection of 
the matrix M\ (which is only differentiable in x) has r eigenvalues close to within C\\x — x'\ 1/r , 
for some C\ > (by (I) above), and s is small enough. Hence we get the second of (A4.2). ■ 

A third property that we need is the following one. 

( e 2 x 2 N e 2 xP\ 

(III) If Mq is as in (II) and Mi is Hermitian and has the form I £ 2 x p* £ 2q J > ™^ ^ Q ^ wo 
r x r and r x s matrices and P ar x s matrix then the first r eigenvalues of Mq + Mi are bounded 
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by \Xj(x,s)\ < Ce 2 x 2 , for j = l,...,r. 



Hint. This is obtained by using (A4.1) which gives the projection over the plane of the r eigenvalues 
within 0(e 2 ) of as integral over a circle of radius |ai£ 



E = — !— (f — —ry ^(Mi \rr) k , (A4.3) 

2TTiJ 7 z-M ^ z-M > y ' 



and one sees that (Mi z _ 1 Mo ) fc has for all fc > 1 the same form of M\, with e 2 replaced by e 2 *, so 
that the sum of the series is the matrix ^ J jj ^ corresponding to the fc = term (it is a d x d 
block matrix with the first r x r block 1 and the other blocks 0) plus a matrix of the same form 

(£" 2 X 2 U/ 
2 / 
e xu h 

so that one checks that the matrix (v/,, (Mo + Mi)v/,/) is a r x r matrix which is proportional to 
e 2 x 2 (i.e. it has the form e 2 x 2 M2(x,e), with Mi bounded for e small and for |x| < 1) and which, 
by construction, has the same eigenvalues as the first r eigenvalues of the matrix M + M\. 

For the above properties see also [RS] and [Ka]. 
Appendix A5. Algebraic identities for the renormalized expansion 

We show that the function h defined through the renormalized expansion solves the equations 
of motion (1.5) for all e € £■ This is essentially a repetition of Ref. [Ge]. We shall check that 
h = egdipf(ip + a, /3o + b), where <p = (a, /3) and g is the pseudo-differential operator with kernel 
g(u> ■ u) = (uj ■ v)~ 2 . We can write h = J2 ue2i r e %u ^h. u , h„ = J2^=o ^n,v (only two terms in this 
series are different from for each v), with h„ „ = Y^k=i Seee TC (n) Val(0), where Oj^„(n) is the 
set of trees in such that the root line has scale n. With respect to the previous sections we 
have dropped the component label 7 € {1, . . . , d} in the definition of the set of trees, for notational 
convenience. 

Note that, for all x 7^ and for all p > one has 



1 = J2 (x, e)) J] x 9 (AM (x, e)), (45.1) 

n—p Q—P 

where the term with n = p has to be interpreted as ip p (A^(x] s)). 
Set * n (x;e) = ^„(AN (x; e)) UpZo Xp( aM (x; e)) for n > 1, * (z;e) - ^(AMfoe)): by using 
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(A5.1) one can write, in Fourier space and evaluating the functions of <p at <p = (xjj + a, /3 + 

oo 

g(u> ■ v) [ E d v f( V )] v = g(u ■ v) £ • v;e) [ed v f{<p)] v 

oo 

= g(u ■ v) J2 *n(« • i/; e)(.g W (« • i/; e))" 1 ^] (w • i/; e) [efy/M] „ 

n=0 

(A5.2) 

= • „) £ ((« • vf (« • i/; e)) ff W (« • i/; e) [efy/fa)] „ 

oo oo 

= g{u;-v)Y J ({"-v?-M^ n \u:-v;e))Y J E Val W' 



where fc ^(n) differs from &^ v {n) as it contains also trees which can have one renormalized self- 
energy cluster T with exiting line £q, if £q denotes the root line of 0; for such trees the line entering 
T will be on a scale p > 0, while the renormalized self-energy cluster T will have a scale nr = q, 
with q + 1 < min{n,p}. 

Remark. Note that in both (A5.1) and (A5.2) only a finite number of addends is different from zero, 
as the analysis of Section 6 shows, so that the two series are well defined. The same observation 
applies to the following formulae, where appear series which, in fact, are finite sums. 

By explicitly separating in (A5.2) the trees containing such self-energy clusters from the others, 

oo oo 

g(u> ■ v) [edrfto] „ = g(u> ■ v) ]T ((« • vf (« • v; e)) E E Val W 

n=o k=ieee^(n) 

oo 

+ g( u ■ v) ((« • vf - (w • i/; e)) (« • i/; e) (A5.3) 

71=1 

OO 71 — 1 OO 



EE^^-^E E Vai w 

p=7i q=0 fe=l 9eSf v (p) 

oo 

- • i/) E (V • iy ) 2 - Ml - n] ( w • £ )) ( w • v ; £ ) 



n=2 
n— 1 p— 1 



EE^^-^E E Vai w> 

p=i«=o fe=i eee j„(p) 
which, by the definitions of h, can be written as 

oo 

g(u> ■ v) [ed v f{ip)] „ = ■ i/) [ E ((" ■ - Ml - l] (« ' ^ £ )) h «,- ( A5 - 4 ) 

oo oo n—1 oo n-lp-1 

+ e *n(u ■ v- E ) e E mM ( w • v ; £ ) h f + E *»( w • £ ) E E mM ( w ■ v > £ )v- 

n=l p=n q=0 n=2 p=l q=0 
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The terms in the second line of (A5.4) can be written as 

oo p— 1 p p— 1 oo 

E(E E MM(« ■!/;£)*„(« ■!/;£) + 2 £ MM(« ■!/;£)*„(«■ i/)) 

p=l g= n— g+1 q^O n-p+1 

oo p— 1 oo 

= EE MM ( u, - I/ ' £ ) E *n(u>-^e)h PiI/ 

p— 1 g— n— g+1 

and, by changing p ^ n and n — > 5, we obtain 

oo n—1 



(w • i/;e)) • 

n=l 3=0 

oo 

• 2 x 9+ i(A l?+1, (^f;£))...i(A' s l( W . ! /;e)))h„ (A5.6) 

s=g+l 

oo n— 1 

= E E Ml9] ( w • ^ £ )xo(A [0] (« ■!/;£))... x,(AM (« • v- e)) h niI/ , 

n=l g=0 

where the identity (A5.1) has been used in the last line (with the correct interpretation of the term 
with s = j + 1 explained after (A5.1)). By the definition of the matrices M\- n ^{x; e) one has 

n-l 

(w • v- e) X o(A[°] (x; e)) . . . X ,(AM (a:; e)) - M ^ (x; e ), (A5.7) 

9=0 



so that, by inserting (A5.6) in (A5.3), after having used (A5.7), we obtain 

oo 

g(u ■ v) [ed v f(<p)] v = g(w ■ u) £ [ ((« • z/) 2 - • u;ej) +M^(u ■ u;e) 



n=0 



= g(u> ■ v) • i/) 2 h n) „ = ^2 h nM = h^, (A5.8) 

n=0 n=0 

so that the assertion is proved. 

Remark. Note that at each step only absolutely converging series have been dealt with, so that 
the above analysis is rigorous and not only formal. 
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